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Good practice from 
the grass roots 


Community-led efforts — not just global ones — 
are key to research integrity. 


hen it comes to research integrity, 
scientists use the language of aspiration, 
whereas policymakers talk about hard 
rules and enforcement. 

That’s one conclusion from an in-depth 
analysis of published research and policy documents in 
research integrity (S. P.J. M. Horbach and W. Halffman Sci. 
Eng. Ethics 23, 1461-1485; 2017). There are other discon- 
nects, too. Countries, disciplines and sectors often approach 
integrity in different ways. For some, it can be confined to 
preventing data fabrication, falsification and plagiarism. 
But integrity is much broader, encompassing quality and 
relevance, as well as recognition of diversity and inclusion. 

The need for a unified approach is slowly gaining rec- 
ognition. The World Science Forum, a biennial meeting 
of researchers and policymakers from different coun- 
tries, issued a declaration at its November conference 
in Budapest that called for, among other things, “har- 
monisation and enforcement of standards of conduct of 
scientific research across borders and across public and 
private research’. The declaration also supported pro- 
cesses by which scientists “can report suspected research 
misconduct and other irresponsible research practices, 
without fear of reprisal”, and it urged clearer procedures 
for responding to such concerns. 

These proposals echo many national and international 
guidance documents that have been produced since 2010, 
when the Singapore Statement on Research Integrity billed 
itself as the first international effort to unify approaches to 
integrity. Subsequent guidelines have included the Bonn 
PRINTEGER Statement, the All European Academies Code 
of Conduct for Research Integrity, the Netherlands Code of 
Conduct for Research Integrity and many more. 

But none yet has the globally respected status of, say, 
the 1964 Declaration of Helsinki, which is widely accepted 
as the standard for the ethical treatment of human partic- 
ipants in medical research. And that raises the question: 
how can the burgeoning official declarations actually 
enhance research integrity in the lab? 

For the working researcher, complying witha research- 
integrity policy cansometimes feel like another unwelcome 
burden — yet another formtofillin, set of datato record or 
online system to feed. And because these policies are often 
handed down from the directors, that can make workers 
unwilling to accept that these practices are something that 
everyone benefits from and needs to engage with. 

That’s where local, community-driven efforts are 
offering a way forward. On page 183, Marcus Munafo6 at 
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the University of Bristol, UK, describes how one network 
of universities and journal publishers is enabling mem- 
bers to exchange ideas on integrity, which could lead to 
consensus about practices such as promoting transpar- 
ency and data sharing. And on page 210, a multinational 
team of researchers and publishers presents a consensus 
definition of predatory journals. This is a welcome and 
genuinely collaborative effort, the result of many rounds 
of discussion between researchers, institutional leaders 
and patient representatives, among other stakeholders. 

Networks for integrity are emerging in the global south, 
too, such asthe African Research Integrity Network, which 
was established in 2017, and the Asia Pacific Research 
Integrity Network, set up two years earlier, in 2015. 

Perhaps some of the most exciting grass-roots efforts 
are those that connect researchers to each other and 
to policymakers. The Embassy of Good Science is one 
such platform. Established under the European Union’s 
Horizon 2020 research programme, it styles itselfas a ‘pub- 
lic square’ where researchers can find guidance and share 
knowledge. It contains links to national ethics codes and 
articles explaining good practice. 

Official declarations about research integrity can some- 
times be perceived by researchers as statements of the 
obvious, particularly because researchers regard the 
pursuit of science itself as upholding values of fairness, 
honesty and scepticism. Policy documents, moreover, can 
be viewed warily by researchers as alist of commandments 
handed down from on high, chiselled into stone tablets. 

But rules and policies are needed — and they can be most 
effective when they arise as a result of engagement with 
grass-roots communities. Local efforts might not produce 
the global unity and consistency of high-level statements, 
but they have a much higher chance of changing research- 
ers’ daily practices, and thus making a genuine difference 
tointegrity in research. The lesson here is familiar: change 
will come when we work alongside the communities we 
wish to change. 


Space for dialogue 


Sustainability calls for collaboration between 
architects and behavioural scientists. 


here was atime when ‘sustainable development’ 

meant economic development, or perpetual 

economic growth — not, as we know it today, 
environmentally sustainable development. 

The change in meaning can be traced to the 

1987 report Our Common Future, chaired by Norway’s then- 

prime minister, Gro Harlem Brundtland. The report involved 

social scientists, natural scientists, industrialists, environ- 

mentalists and policymakers emerging from their silos 

to talk to each other to understand how humans alter the 

global environment. The report helped such collaborative 
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processes to become mainstream, alongside the idea of 
treating the environment and development as one issue. 

Some fields quickly grasped that interdisciplinary work 
is essential to understanding environmental change, and 
to mitigating — or adapting to — its effects. Confirming a 
human cause for climate change required the combined 
efforts of meteorologists, oceanographers and geogra- 
phers, among others. Replacing the ozone-depleting chem- 
icals used in spray cans and refrigerators needed chemists 
to talk to product designers. But, as a report this week in 
Nature Sustainability shows, other fields have not got so 
far in their interdisciplinary journey (L. Kotz et al. Nature 
Sustain. 2, 1067-1069; 2019). 

Ina project convened by the journal and the Convergent 
Behavioral Science Initiative at the University of Virginia 
in Charlottesville, an international group of architects, 
designers and engineers spent a year with behavioural 
scientists, investigating how their disciplines could better 
work together, and why they needed to do so. 

Behavioural science has an existing and essential 
relationship to the built environment: we have to study 
how people live, work and move to create liveable buildings 
and towns. But the group established that, when it comes 
to sustainability, there’s room for closer working, and the 
report amounts to an agenda for joint research. Potential 
questions include: how do architects and designers make 
decisions? To what extent can behavioural science in other 
contexts be applied to sustainable design and architecture? 
Doarchitects feel a duty to promote responsible energy use? 

Cross-disciplinary working requires careful communica- 
tion and confidence-building. As the example of defining 
sustainable development shows, disciplines have their own 
languages and can interpret terms differently. 

Lessons in interdisciplinarity can also be learnt fromthe 
‘science wars’ of the mid-1990s, a tense time in the relation- 
ship between natural scientists and the sociologists who 
study how research is done. Part of the ambition for soci- 
ologists of science is to place a mirror before researchers, 
to demonstrate potential flaws in their methods. But some 
eminent researchers saw these studies as an intrusion, and 
thought that natural scientists had little to learn from them. 

One way to ease disciplinary tensions could be to under- 
score that sustainability calls for behavioural change 
at all levels — necessitating more research across all 
sectors. Governments, for example, often interact with 
independent researchers who study howtoimprove policy, 
including how government itself needs to adapt if it is to 
drive sustainability more effectively. Similarly, business 
schools produce case studies on how companies can adapt 
to facilitate that change. Behavioural research could help 
all ofus — individuals and communities — to make changes 
to how we behave, whether it is taking more public trans- 
port or just turning the thermostat down a degree. 

Along with governments, industry and individuals, the 
built environment consumes energy and produces waste, 
which makes it just as pivotal to sustainability. As the Nature 
Sustainability report says, collaborating effectively and 
learning from each other can be tough. But considering the 
planetary situation, not doing so has much higher costs. 
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No more ‘manels’ 


Nature’s new code of conduct strives for more 
diversity at research meetings and events. 


hat women from under-represented minorities 
receive few speaking invitations to the world’s 
largest Earth-science conference has again 
shone a spotlight on science’s diversity deficit 
(H.L. Ford et al. Nature 576, 32-35; 2019). 

Conferences are essential for research communication, 
and taking partis important for career progression. But turn- 
ing the dial on diversity — and stopping it from slipping back 
—is proving difficult. Our investigation this year of ‘manels’ 
and ‘manferences’ — panels and conferences dominated by 
male speakers — showed that sometimes a heroic effort to 
diversify them one year is followed by business as usual the 
next (Nature 573, 184-186; 2019). 

At Nature, we are aware of our own shortcomings — that 
our authors and referees, for example, include too few 
women — and of our responsibilities to turn things around 
(Nature 558, 344; 2018). This week, we begin a more con- 
certed push to promote diversity across our editorial and 
publishing activities, including concrete commitments in 
the events that we organize (see go.nature.com/36jtfr). 

In2019, Nature and other journals inthe Nature Research 
portfolio hosted, or co-hosted, more than 30 events ina 
range of disciplines. But despite informal efforts to make 
our conferences more inclusive, women and people from 
minority groups still make up only a small proportion of 
our speakers. We are therefore formalizing our efforts into 
a published code of conduct. This will apply not only to 
Nature Conferences but to all scholarly events organized 
or co-organized by Springer Nature. 

The code commits us to having no male-only organizing 
committees for Nature Conferences planned from this point. 
We will invite equal numbers of women and menas speakers, 
whether we're selecting for keynote presentations or from 
abstract submissions. We also committo having no manels at 
our events, and to monitor and report progress against these 
goals at the end of each calendar year. Planning for most of 
our events in 2020 is already advanced, so the full effect of 
our commitment will be seen from 2021. 

Nature Conferences must be welcoming, safe, 
collaborative and productive for all attendees. Our code 
states that we expect participants to be considerate of 
diverse views and cultures, and respectful and collabora- 
tive in their discussion and critiques of ideas. Appropriate 
sanctions will be applied where the code is not followed. 

We also commit to supporting diversity more broadly, 
including in geography, ethnicity, culture, career stage, 
disability and sexual orientation. With time, we aim to 
develop our code further to address this explicitly. 

Scientific events must be more inclusive. We hope that 
this initiative —like similar ones in many other organizations 
— goes some way to reaching that goal. 
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A personal take on science and society 


World view 


By Marcus 
Munafo 


Raising research quality will 
require collective action 


is f 


Institutions must act together to reform 
research culture, says Marcus Munafo. 


odern research is far removed from that 

done by the independent scientist of 

Charles Darwin’s day. But that model still 

underpins what we reward. And the tension 

is showing. The number of authors listed on 
published papers has increased fivefold. But our system of 
incentives — accreted haphazardly as science evolved from 
hobby to career — still focuses on individuals. 

Funding, appointments, promotions, tenure, prizes and 
so on emphasize individual achievement and overlook 
deeds that benefit everyone and should be valued explicitly, 
suchas producing usable tools or sharing code. If we want 
to move towards atransparent model of research, we need 
to reward open-research practices. If we want researchers 
to work well in large collaborations, we need totrain them 
in communication skills and collective self-scrutiny. 

In the past five years, many studies have attempted to 
assess the prevalence of questionable research practices. 
The Netherlands has commissioned a country-wide survey. 
Others have focused on specific fields, from ecology and 
evolution to health services. Describing the scale of the 
problem is necessary, but insufficient. 

We must reflect on how (and, moreimportantly, why) ques- 
tionable research practices and undesirable behaviours arise 
and persist. What are the flaws in our institutions’ cultures 
and practices that allow this conduct to proliferate? We must 
find the root causes. Have we disincentivized solid, cumu- 
lative work, replication studies and publishing null results 
— essential if science is to self-correct efficiently — with our 
relentless focus on groundbreaking findings? 

That self-inspection is in the air. Institutions are commit- 
ting to working together to determine how their cultural 
practices, such as emphasizing the importance of novelty, 
discovery and priority, undermine the value of replication, 
verification and transparency. That is the goal of the UK 
Reproducibility Network, which I co-founded earlier this 
year. It started as informal groups of researchers at individ- 
ual institutions that met with representatives from funders 
and publishers (including Nature) who were open to dis- 
cussions about how best to align open-science initiatives 
—reproducibility sections in grant applications and report- 
ing checklists in article submissions, for example. Now 
institutions themselves are cooperating to consider larger 
changes, from training to hiring and promotion practices. 

Our ten university members span the United Kingdom 
from Aberdeen to Surrey, and we expect that list to grow. 
Each will appoint a senior academic to focus on research 
quality and improvement. Figuring out which system-level 
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changes are needed and howto make them happen will now 
besomeone’s primary responsibility, nota volunteer activity. 
What changes might ensue? Earlier this year, the University of 
Bristol, wherel work, made the use of data sharing and other 
open-research practices an explicit criterion for promotion. 

But one institution will make little difference on its own. 
For better practices to become the norm, many universities 
needtoact collectively. Changes to incentives ata single insti- 
tution will not make new behaviours stick, not least because 
practices required in only one place canact asacareer taxon 
its scientists. Only if changes occur across many institutions 
will the impacts permeate scientific culture. 

The same is true for training. If universities agree that all 
their graduates must reach common standards — in data 
skills, for example — future scientists will do better research 
and collaborate across institutions in a way that is hard to 
imagine today. Consider the productivity gains if compe- 
tence in the programming language R — which can be used 
torunstatistical analyses across a broad range of areas — was 
agiven. Research would become moreefficient, like the rail- 
ways did after adopting acommon standard for track gauge. 

Public-health campaigns to reduce smoking demonstrate 
how coordinated action can yield broad cultural change. 
In 1974, 45% of the UK population smoked; now it is below 
15%. This was not the result of any one intervention, but a 
coordinated programme of taxes, bans inthe workplace and 
public spaces, and advertising restrictions. Gradually, the 
‘normal’ behaviour changed. Someone wishing to smoke 
today would probably ask their companions for permission; 
ageneration ago, they would have offered them a cigarette. 

When it comes to changing the culture of science, the 
UK Reproducibility Network is not alone. Numerous ini- 
tiatives now link members of the research community to 
support robust, transparent research. Examples include 
the Center for Open Science in the United States (founded 
in 2013), the QUEST Center in Germany (founded in 2017), 
the Research on Research Institute with eight participating 
countries, launched this year, anda proliferation of grass- 
roots networks of researchers in many countries. 

But these cultural changes might falter. Culture eats 
strategy for breakfast — grand plans founder onthe rocks 
of implicit values, beliefs and ways of working. Top-down 
initiatives from funders and publishers will fizzle out ifthey 
are not implemented by researchers, who review papers 
and grant proposals. Grass-roots efforts will flourish only 
if institutions recognize and reward researchers’ efforts. 

Funders, publishers and bottom-up networks of 
researchers have all made strides. Institutions are, in many 
ways, the final piece of the jigsaw. Universities are already 
investing in cutting-edge technology and embarking on 
ambitious infrastructure programmes. Cultural change 
is just as essential to long-term success. 
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The world this week 


Newsin brief 


SCIENCE-FUNDING PLEDGES 
AHEAD OF ‘BREXIT ELECTION’ 


As Nature went to press, the 
United Kingdom’s three main 
political parties had each 
pledged to increase spending 
onscience ahead of the 
12 December general election, 
which was largely overshadowed 
by the country’s 2016 vote to 
leave the European Union. 
Conservative Prime Minister 
Boris Johnson called the election 
after Parliament was unable to 
agree on howto move forward in 
the wake of the Brexit vote. 
During the campaign, the 
three main parties diverged 
widely on their plans for the 
impending divorce, with the 
Conservatives vowing to take 
the country out of the bloc by 
31 January, Labour promising a 
second ‘people’s vote’ with an 
option to remain in the EU and 
the Liberal Democrats vowing to 
stop Brexit altogether. 
The spectre of Brexit 
“looms large over science”, 
says James Wilsdon, director 
of the Research on Research 
Institute at the University of 
Sheffield, UK. As well as its wide 
political and economic effects, 


Brexit’s changes to freedom of 
movement and participation 
in EU programmes could have 
major impacts on scientists. 

But although Brexit was the 
dominant campaign issue, the 
Conservatives, Labour and 
the Liberal Democrats had all 
promised in their manifestos to 
invest more money in research. 

The United Kingdom currently 
spends about 1.7% of its gross 
domestic product (GDP) on 
research and development, 
totalling about £7.5 billion 
(US$9.8 billion) per year. 

The Conservative Party — 
whose manifesto mentioned the 
words ‘research’ and ‘science’ 
more times than ever before, 
and more than those of the other 
two main parties (see ‘Political 
push’) — was pressing for the 
lowest increase. It pledged to 
spend 2.4% of GDP on science. 

The other two parties were 
targeting 3%, with Labour 
pledging to hit the goal by 
2030 and “create an innovation 
nation’. The Liberal Democrats 
promised an “interim target” of 
2.4% by 2027. 


POLITICAL PUSH 


Mentions of science in the Conservative Party 
manifesto are at their highest in two decades. 


™ Conservative MLabour ™® Liberal Democrat 


40 


or ‘scien*’ in manifesto 
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INDONESIA FIRE 
ANALYSIS GETS 
HEATED 


How much did Indonesia burn 
this year? An international 
research organization has taken 
down an online report that 
suggested fires burnt more than 
1.6 million hectares of land in 
the country in 2019, 40% more 
than the government calculated 
for the same period. 

The Indonesian government 
had criticized the analysis, by 
the Center for International 
Forestry Research (CIFOR), 
saying that it relied on satellite 
data that hadn’t been confirmed 
with ground observations. 

Raffles Panjaitan, director 
of forest and land-fire 
managementat the Ministry 
of Environment and Forestry 
inJakarta, says the ministry 
used satellite data and ground 
observations to calculate that 
just under 950,000 hectares 
were burnt between January and 
October. The ministry relies on 
imagery from NASA’s Landsat 8 
Earth-observation satellite. 

CIFOR, based in Bogor, 
Indonesia, used time-series 
imagery from the European 
Space Agency’s Sentinel-2 
Earth-observation satellites. On 
6 December, the group removed 
the analysis from its blog, 
saying it should have had it peer 
reviewed before posting. 

Three days earlier, CIFOR 
landscape ecologist David 
Gaveau had told Nature that 
the discrepancies might stem 
from differences in the satellite 
imagery the two groups used. 
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SPACE TELESCOPE 
HERALDS NEW ERA OF 
EXOPLANET SCIENCE 


The European Space Agency 
is set to launch an exoplanet 
telescope that will perform 
detailed studies of hundreds 
of known worlds beyond the 
Solar System. Called CHEOPS 
(Characterising Exoplanet 
Satellite), the €50-million 
(US$55-million) spacecraft 
is scheduled for launch from 
Kourou, French Guiana, on 
17 December. 

Equipped witha single 
camera, CHEOPS will peer at 
stars already known to host 
exoplanets. By observing the dip 
inthe brightness of a star’s light 
as a planet passes in front of it, 
astronomers will work out the 
sizes of these worlds and study 
some of their atmospheres, 
providing crucial information 
onthe formation and evolution 
of a variety of exoplanets. Over 
its 3.5-year scientific mission, 
which will begin in April 2020, 
the telescope will study between 
300 and 500 worlds. 

The telescope marks a 
departure from previous 
exoplanet missions because it 
is the first designed to study — 
rather than merely find — alien 
worlds. “We’re moving from 
discovery to characterization,” 
says Kate Isaak, a project 
scientist on the mission at 
the European Space Research 
and Technology Centre in 
Noordwijk, the Netherlands. 


Sign up to get essential science 
news, opinion and analysis 
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Visit go.nature.com/newsletter 
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Acave painting in Indonesia portraying ahunt might be 


the oldest recorded story in human history. The scene — 
discovered in 2017 and now dated to at least 43,900 years 
ago — is more than 20,000 years older than depictions 
of clear narrative scenes in cave art from Europe, such 

as those found in the Lascaux Cave in France. One panel 
(pictured) from the Indonesian cave on Sulawesi shows 

a buffalo called an anoa being hunted by smaller figures 
that might be human-animal chimaeras. Such figures, 
also knownas therianthropes, point toa human capacity 
to imagine and depict objects that do not exist in the 
natural world, says Adam Brumm, an archaeologist at 
Griffith University in Brisbane, Australia, whose team 
described the art (M. Aubert etal. Nature https://doi. 
org/10.1038/s41586-019-1806-y; 2019). Previously, the 
oldest-known therianthrope was a40,000-year-old 
human-lion ivory statue from Germany. The Sulawesi 
cave and its paintings were found by a local archaeologist 
and caver named Hamrullah. “We've seen hundreds of 
rock art sites in this region but we’ve never seen anything 
like ahunting scene,’ adds Brumm. 


NASAIMAGES REVEAL 
CRASHED INDIAN 
MOONLANDER 


Images from NASA’s Lunar 
Reconnaissance Orbiter have 
been used to confirm the final 
resting place of India’s Moon 
lander, Vikram. The country’s 
space agency lost contact with 
the craft just moments before it 
was supposed to touch down on 
7 September. 

A few days later, the Indian 
Space Research Organisation 
said it had located the lander, 
but did not release any images of 
the site. 

Just over two weeks after the 
mishap, the team that manages 
the Lunar Reconnaissance 
Orbiter’s camera posted 
onits website an image of 
Vikram’s intended landing 
site near the lunar south pole. 

It was then contacted by an 
engineer in Chennai, Shanmuga 
Subramanian, who had spotted 
features in the image that looked 
like debris. 

The team was able to confirm 
that the features were debris 
from the craft by comparing 
amore detailed image taken 
before the landing with 
photos taken after. The photos 
reveal a large crater from the 
main impact and a trail of 
debris spread across several 
kilometres. 

Vikram was supposed to 
touch down and explore the area 
nearby. 

The craft was India’s first 
attempt to ‘soft’ land on 
the Moon, as part of the 
Chandrayaan-2 mission. 
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The world this week 


News in focus 
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Plato (left), Buddha and Confucius (right) were key players in the emergence of an enlightened era, according to philosopher Karl Jaspers. 


WHEN DID SOCIETIES BECOME 
MODERN? ‘BIG HISTORY’ 
DASHES IDEA OF AXIAL AGE 


Humanity’s supposed singular transition to modernity in the first millennium BC 
was much messier than previously thought, finds sweeping study of historical data. 


By Laura Spinney 


t’s an idea that has been influential for 

more than 200 years: around the middle 

of the first millennium Bc, humanity 

passed through a psychological water- 

shed and became modern. This ‘Axial 
Age’ transformed an archaic world of divine 
rulers, slavery and human sacrifice into a more 
enlightened era that valued social justice, 
family values and the rule of law. The appeal 
of the general concept is such that some have 
claimed humanity is now passing through a 
second Axial Age — driven by rapid population 
growth and technological change. Yet accord- 
ing to the largest-ever cross-cultural survey of 
historical and archaeological data, the first of 
these ages never happened. 


Major changes did take place inhow humans 
understood their place in the Universe and 
their relationships with each other, finds the 
analysis. But sometimes these societal shifts 
happened earlier than the first millennium BC, 
and sometimes later. And they did not always 
occur in the societies typically considered 
‘axial’ — what is now Greece, Israel-Palestine, 
Iran, India and China — although they did take 
place insome other civilizations. “We couldn't 
find any consistent Axial Age that was confined 
to those five societies,” says Jenny Reddish, 
an anthropologist at the Complexity Science 
Hub in Vienna and one of the survey’s authors. 

The work, published this week as a 500-page 
book entitled Seshat History of the Axial Age, 
spotlights the ‘big data’ approaches to history 
that have become popular in the past decade. 


© 2019 Springer Nature Limited. All rights reserved. 


These can complement the very specialized, 
detailed work of standard historians witha 
broad-brush, comparative approach to the 
evolution of societies that are widely sepa- 
rated in time and space. The current finding 
is likely to be followed by many other studies 
that address the origins of complex societies 
using these new techniques. 


Lone innovation? 

Although the Axial Age concept dates back to 
the eighteenth century, its best-known pro- 
ponent made his case in the 1940s. In 1948, 
German-Swiss philosopher Karl Jaspers 
wrote that between 800 and 200 BC, the five 
aforementioned societies independently 
embraced moral universalism, or the idea 
that people are morally bound to each other 
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by their common humanity. This step, Jaspers 
argued, was prompted by the teachings of lone 
cultural innovators — Plato, the Hebrew proph- 
ets, Zarathustra, Buddha and Confucius — and 
initiated a trend that eventually touched the 
entire globe. 

The latest work challenges this idea. It 
builds onasurvey published in 2018 and led by 
anthropologist Daniel Mullins at the University 
of Oxford, UK, and both draw ona pioneering 
historical and archaeological database called 
Seshat — after the ancient Egyptian goddess of 
recordkeeping. The 2018 study reported that 
the picture of a simultaneous axial shift was 
much less clear than scholars had suggested 
(D.A. Mullins etal. Am. Sociol. Rev. 83, 596-626; 
2018). Its sample comprised the five socie- 
ties mentioned by Jaspers and five others the 
authors had selected, which thrived between 
3,000 BC and AD 2,000 — in modern-day Italy, 
Turkey, Egypt, Cambodia and Japan. 

The latest survey — conducted by some of 
the same scholars — expands the sample to 
include societies in southeast Asia, Polynesia, 
West Africa and Northand South America dur- 
ing the same period. The work, which has not 
been peer reviewed, comes to the same con- 
clusionas the 2018 analysis: it shows “beyond 
reasonable doubt, that the pattern we need to 
explain is much messier than Jaspers’ model of 
amoral and intellectual revolution in first-mil- 
lennium BCE Eurasia’, says lan Morris, a histo- 
rian and archaeologist at Stanford University 
in California, who was not involved in the sur- 
vey but wrote the book’s foreword. 


Digging for data 


Many historical databases are now under 
development, but Seshat is among the oldest, 
the most ambitious and the fastest growing. 
Founded in 2011 by historians, anthropolo- 
gists and mathematicians and funded by UK 
and European Union research grants, among 
other sources, it now stores information on 
more than 450 societies going back as far as 
4000 Bc. Research assistants input data from 
primary and secondary historical sources using 
a formula that allows for comparison across 
time and space while factoring in disagreement 
or uncertainty. Specialists — suchas historians 
and archaeologists — then validate the work. For 
the latest survey, the researchers drew up alist 
of 12 proxy measures that scholars widely agree 
onfor components of ‘axial transformation’ ina 
society. These include the presence of a formal 
legal code, beliefin an all-knowing supernatural 
being and the existence of full-time bureaucrats 
who held rulers accountable. The researchers 
then tracked the proxies through time and 
across the expanded list of societies. 

Their findings throw up a wide spectrum 
of times and places for the emergence of 
axial features. There has long been debate as 
to whether Egypt — a precociously sophisti- 
cated society — should have been included 
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ANCIENT TO MODERN 


Historians have long suggested that an ‘Axial Age’ — 
a supposed simultaneous shift to modernity in several 
ancient societies — occurred some time in the first 
millennium sc, but their estimates of when this 
happened vary widely. Now, a team of scholars 
suggests that there was no such neat transition. 


Mean of published estimates 


Proposed axial age 


1500 1000 500 1 
BC BC BC AD AD 


Estimates published from 1873 to 2016. 


“The disagreement is less 
about the timing, the place, 
than about the causal 
mechanism.” 


in Jaspers’ original list, but the survey found 
clear evidence of axiality there, long before 
Jaspers’ ‘age’. “Starting around 1200 BCE, inthe 
Ramesside period, you see lots of changes in 
religious practice,” says Joe Manning, a histo- 
rian at Yale University in New Haven, Connecti- 
cut, who co-authored the Egypt chapter of the 
study. “It has been called the age of personal 
piety.” In Anatolia, meanwhile, the Hittites 
were applying the rule of law universally in 
the second millennium BC. By contrast, China, 
although included in the original group of 
axial civilizations, didn’t pivot until several 
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centuries after 200 BC, the latest findings 
suggest. 

According to the Seshat team, the data also 
clearly undermine another of Jaspers’ key 
claims: that innovation arose independently in 
the five core societies, which he referred to as 
“islands of light”. These societies were engaged 
in a “ton of cross-cultural exchange”, says 
historian and Seshat project manager Daniel 
Hoyer at George Brown College in Toronto, 
Canada. “The Rabbinical tradition and even 
Plato’s writings aren’t really conceivable with- 
out the Zoroastrianism and Egyptian moral 
ideals and Hittite legalism that went before.” 


Differing definitions 


The study has its critics. Hans Joas, a sociol- 
ogist of religion at the Humboldt University 
of Berlin, says that the authors distort the 
existing literature to make their point. “It’s 
unfortunate that people still use the term 
‘Axial Age’ as if it mainly referred to a myste- 
rious simultaneity,” says Joas. “That claim has 
long been given up.” 

Existing definitions of the Axial Age differ 
widely (see ‘Ancient to modern’), however, and 
the current survey strengthens others’ suspi- 
cion that the concept has served its purpose. 
“It was useful initially for signalling that antiq- 
uity was not one homogeneous period, that 
there was important evolution between the 
archaic world and late antiquity,” says Nicolas 
Baumard, who studies cultural evolution at 
the Ecole Normale Supérieure in Paris. But 
according to him, the debate has moved on 
— and the Seshat survey vindicates that. Now, 
says Baumard, “the disagreement is less about 
the timing, the place, than about the causal 
mechanism”. 

The survey is unlikely to be the last word on 
the Axial Age, if only because psychological 
states are hard to quantify. There is also an 
ongoing and at times vitriolic debate about 
the best way to build historical databases. 
For example, a rival to Seshat, the Database 
of Religious History, gets specialists such as 
historians to input the data — rather just over- 
see non-specialists’ efforts. 

Such rifts notwithstanding, says Morris, 
big-data approaches open up the exciting 
possibility of testing different theories about 
what caused axial-type changes. Were they, 
as he, Baumard and others have proposed, a 
consequence of increasing affluence? Or were 
they, as the Seshat group contends in its latest 
work, a way of maintaining social cohesion as 
societies became more complex? 

The books’ claims might have implica- 
tions for suggestions that humanity is in, or 
approaching, anew axial age. Morris says such 
claims aren’t surprising given the speed with 
which technology is transforming societies, 
but the nature of any new axial age is every bit 
as fuzzy as that of the first — as revealed by the 
Seshat survey. 


SOURCE: D. A. MULLINS ETAL. AM. SOCIOL. REV. 83, 596-626 (2018). 
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The earthquake that devastated San Francisco, California, in 1906 arose from the San Andreas fault. 


TWO BIG QUAKE 
FAULTS MIGHT 
BE LINKED 


Analysis suggests that quakes on the Cascadia fault 
off California can trigger shaking on the San Andreas. 


By Alexandra Witze 


wo of North America’s most fearsome 
earthquake zones could be linked. 
Acontroversial study argues that at 
least 8 times in the past 3,000 years, 
quakes madea one-two punch off the 
west coast of the United States. A quake hit 
the Cascadia fault off the coast of northern 
California, triggering a second quake on the 
San Andreas fault just to the south. In some 
cases, the delay between the quakes might 
have been decades long. 
The study suggests that Cascadia, which 
scientists think is capable of unleashing a 


magnitude-9 earthquake at any time, could 
set off quakes on the northern San Andreas, 
which runs under the San Francisco Bay Area. 
Several earthquake scientists told Nature that 
more work is needed to confirm the provoca- 
tive idea. Researchers have long considered 
the two faults seismically separate. 

Chris Goldfinger, a geologist and palaeoseis- 
mologist at Oregon State University in Corval- 
lis, will present the findings on 13 December at 
ameeting of the American Geophysical Union 
in San Francisco. “This is mostly a circumstan- 
tial case,” he says. “I don’t havea smoking gun.” 

Goldfinger and his colleagues first sug- 
gested in 2008 that earthquakes in the 
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southern part of Cascadia could trigger quakes 
on the northern San Andreas (C. Goldfinger 
et al. Bull. Seismol. Soc. Am. 98, 861-889; 
2008). The scientists reported finding layers 
of churned-up, sandy sediment in sea-floor 
cores drilled offshore. These layers, called 
turbidites, usually form when earthquakes 
cause underwater landslides. The research- 
ers found turbidites in Cascadia that seemed 
to formjust before similar turbidites near the 
San Andreas — perhaps as a Cascadia quake 
triggered a San Andreas one. 

But it was hard to pinpoint exactly when the 
turbidites had formed, and Goldfinger knew 
he needed more evidence. Now he has data 
from seven cores drilled offshore in south- 
ern Cascadia and seven cores drilled near 
the northern San Andreas. The two sites are 
around 100 kilometres apart — close enough 
to feel shaking from both faults. 

At eight places in both sets of cores, Gold- 
finger spotted unusual, two-layered turbidites 
and realized that they were telling him some- 
thing new. The two-layered turbidite “has to 
be two quakes recorded together”, he says. As 
Goldfinger sees it, a Cascadia quake shook the 
coastline first, causing landslides that show up 
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inbothsets of cores as the first layer of turbid- 
ites. Then, at some later point, the northern 
San Andreas also shook, causing the second 
turbidite layer to form. 

“This story is pretty convincing,” says 
Jason Patton, an engineering geologist with 
the California Geological Survey in Sacra- 
mento who was a co-author on the 2008 
paper. “Cascadia turbidites are covered by 
San Andreas turbidites, so the Cascadia tur- 
bidites were deposited first.” 

Others are reserving judgement. Turbidi- 
tes show that the ground shook at some point 
in the past, but it’s difficult to tell exactly 
when or where those quakes happened, 
says Joan Gomberg, a seismologist at the US 


Geological Survey in Seattle, Washington. 
“All this uncertainty leaves multiple, equally 
plausible interpretations on the table — most 
of which are not sensational,” she says. 

Ross Stein, a seismologist with the earth- 
quake-preparedness firm Temblor in Redwood 
City, California, wants to see detailed mod- 
elling of how stress from the Cascadia fault 
might be transferred to the northern San 
Andreas. Scientists generally agree that alarge 
earthquake can sometimes trigger another on 
anearby fault. But it’s not clear whether that 
might happen between southern Cascadia and 
the northern San Andreas, Stein says. 

This week at the conference, Goldfinger 
says, “I’mjust going to lay out the case.” 


PUBLISHERS REVIEW 
RESEARCH ON CHINESE 
MINORITY GROUPS 


Springer Nature and Wiley have concerns about the 
ethics of papers on genetics and facial recognition. 


By Richard Van Noorden & 
Davide Castelvecchi 


wo science publishers are reviewing 
the ethics of research papers in which 
scientists backed by China’s govern- 
ment used DNA or facial-recognition 
technology to study minority groups 
in the country, such as the predominantly 


Muslim Uyghur population. 

Springer Nature (which publishes Nature) 
and Wiley want to check that the study partici- 
pants gave informed consent, after researchers 
and journalists raised concerns that the papers 
were connected to China’s heavy surveillance 
operations in the northwestern province of 
Xinjiang. China has attracted international 
condemnation — and US sanctions — for mass 


Officers patrol in China's Xinjiang region, where there have been mass detentions. 


192 | Nature | Vol 576 | 12 December 2019 


© 2019 Springer Nature Limited. All rights reserved. 


detentions and other human-rights violations 
inthe province. The Chinese government says 
it is conducting a re-education campaign to 
quell what it calls a terrorist movement. 

“We are very concerned about research 
which involves consent from vulnerable pop- 
ulations,” says a spokesperson from Springer 
Nature (Nature’s news team is editorially 
independent of its publisher). 

The publishers’ announcements, which The 
New York Times reported on 4 December, fol- 
low rising concerns about the publication of 
such work. Last week, Yves Moreau, acompu- 
tational biologist at the Catholic University of 
Leuven in Belgium, wrote an opinion article in 
Nature warning of the dangers that accompany 
the proliferation of DNA profiling and calling 
for all unethical work in biometric research 
to be retracted. 

Springer Nature said that it would add notes 
of concern about consent to two papers” that 
reported studies using DNA from hundreds 
of Uyghurs to predict height or facial shape. 
One, published in Human Genetics’, was high- 
lighted in a separate New York Times article 
that described worries that the participants 
hadn’t given informed consent. 

Both papers state that volunteers gave 
consent, and that the studies were approved 
by an ethics committee from the Institute 
of Forensic Science, which is affiliated with 
China’s police and security authority. 

“We are ordinary forensic scientists who 
carry out forensic research following the 
scientific research ethics norms,’ said Caixia 
Liofthe Institute of Forensic Science in Beijing, 
a co-author of both papers”’, in an e-mail to 
Nature’s news team. He said that “all individuals 
provided written informed consent”. 

Moreau says that it’s hard to see how Uyghur 
peoples could give free, informed consent to 
DNA or facial-recognition work — given that so 
many people in that ethnic group have been 
sent to internment camps (which China calls 
education facilities). 

Springer Nature has identified a number 
of other ‘papers of concern’ published by its 
journals, the spokesperson adds, which are 
being investigated. And it has updated its 
guidance about the need to gain explicit and 
informed consent in studies that involve clini- 
cal, biomedical or biometric data from people. 

Moreau says: “Expressions of concerns area 
welcome first step, but this is only meaningful 
if it is the start of a large-scale ethical review 
of all forensic population-genetic research 
on Chinese populations and of all biometric 
research.” 

Wiley, meanwhile, said it was opening a for- 
mal investigation into an article that described 
an analysis of a database of photos of Uyghur, 
Tibetan and Korean people using various 
facial-recognition algorithms’. In September, 
four researchers, including Moreau and Jack 
Poulson of the advocacy group Tech Inquiry 
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in Toronto, Canada, had asked Wiley to retract 
the paper because of the potential for abuse of 
facial-recognition technology, and the “racial 
overtones of the authors’ language”. One of the 
paper’s co-authorsis affiliated with Curtin Uni- 
versity in Perth, Australia, which this month 
also requested that Wiley review the paper. 


A spokesperson for Wiley said that the ] 


publisher was now re-evaluating the paper, 
following an initial review that found the 
journal had followed the existing guidelines. 
In their September letter, the researchers 
noted that, according to guidelines laid down 
by the Committee on Publication Ethics, a 
London-based publisher-advisory body, 
papers can be rejected on ethical grounds 
even if they come with approvals from an eth- 
ics committee (as the Wiley paper did). The 
guidelines also say that journals should take 
ene care when the research is conducted 
n “vulnerable groups”, which Springer Nature 
has emphasized inits latest editorial policies. 


Widespread concerns 


There are numerous papers that report the use 
of biometric technology to study Uyghur and 
other minority ethnic groupsin China. Moreau 
wrote in his opinion article that he had identi- 
fied 40 articles co-authored by members of the 
Chinese police in 3 leading forensic-genetics 
journals — published by Springer Nature and 
by Elsevier — that describe the DNA profiling of 
Tibetans and people from Muslim minorities. 
A spokesperson for Elsevier said that the 
company is producing more comprehensive 
guidelines for the publication of genetic data, 
but that it was “unable to control the potential 
misuse of population data articles” by third 
parties after publication. 

Springer Nature also said that it would 
tighten its oversight of the academic-confer- 
ence proceedings that it publishes. Journalists 
have previously raised concerns about numer- 
ous conference papers that describe studying 
Uyghur groups, including manuscripts froma 
biometrics conference held in Xinjiang in 2018 
that Springer Nature published as a book°. 

The Institute of Electrical and Electronics 
Engineers (IEEE) has also published conference 
proceedings describing facial-recognition 
analyses in Uyghur populations (see, for 
example, refs 6 and 7). “IEEE is committed to 
reviewing our policies to ensure more consist- 
ent application of this process across the full 
range of IEEE publications,” IEEE president and 
chief executive José Moura wrote in an e-mail. 
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THE HUNT FORICE 
THAT WITNESSED WEST 
ANTARCTICA’ COLLAPSE 


Ice at Hercules Dome site could reveal how 
susceptible the region is to warming. 


By Jeff Tollefson 


ometime this month, scientists in 
Antarctica plan to start up their snow- 
mobiles and begin radar surveys of 
a thick ridge of ice called Hercules 
Dome. The dome - which sits 400 kilo- 
metres from the South Pole, between East and 
West Antarctica — could provide crucial clues 
to the future of the continent's vast ice sheet. 

The surveys are intended to guide the 
drilling of the United States’ next deep 
ice core. Glaciologists hope to retrieve a 
detailed climate record ofa period 116,000 to 
130,000 years ago, when temperatures as little 
as 1°C warmer than today’s are thought to have 
driven the collapse of the ice covering West 
Antarctica. 

A better understanding of what happened 
then could help scientists to predict the 
behaviour of West Antarctica as climate 
change intensifies. The pace at which the 
region’s glaciers are flowing to the sea has 
increased in recent years, and many scientists 
fear that rising temperatures have triggered 
runaway melting. West Antarctica’s ice 
contains enough water to raise sea levels by 
more than 3 metres. 
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“If the West Antarctic ice sheet collapsed, 
Hercules Dome would basically be waterfront 
property,’ says Eric Steig, a glaciologist at the 
University of Washington in Seattle who is 
leading the project. He is hopeful that scien- 
tists will be able to see the signal of the region’s 
long-ago collapse in ice from the dome. 

The US National Science Foundation (NSF) 
has given Steig and his colleagues nearly 


“There may well be ice that 
is two million years old at 
the bottom.I wouldn't be 
surprised.” 


US$630,000 to conduct radar surveys of ice 
depth and structure across Hercules Dome. 
That work started inJanuary. The $1.5-million 
drilling project could begin as soon as 2022, 
pending the agency’s approval. 

“The Hercules structure will have served as 
witness to what the atmosphere and what the 
oceans were doing when the West Antarctic 
ice sheet collapsed,” says Mary Albert, a gla- 
ciologist at Dartmouth College in Hanover, 
New Hampshire, and head of the US Ice Drilling 
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Program, which advises the NSF. 

Bad weather and unexpectedly rough terrain 
made it difficult to land the NSF’s Hercules 
cargo plane at Hercules Dome last year, limit- 
ing scientists’ ability to conduct radar surveys. 
Nonetheless, the University of Washington 
team managed to identify two potential drill- 
ing sites where the depth of the ice ranges from 
1.6 to 2.8 kilometres, says glaciologist Knut 
Christianson, whois leading the radar surveys. 

This year, he and his colleagues plan to 
spend around 30 days collecting data — 
helped along by anewly groomed ice runway. 
Christianson’s team is looking for ice that is 
solidly frozen all the way to Antarctic bedrock. 
That increases the odds that the researchers 
can recover air bubbles trapped in the oldest 
layers of snow and ice, which accumulated 
while Earth was cycling through ice ages. 

These air bubbles can reveal how the levels 
of greenhouse gases, trace gases and aerosols 
in the atmosphere changed from decade to 
decade, helping researchers to reconstruct 
past climate. Data from ancient ice at Hercu- 
les Dome could help to reveal whether the 
West Antarctic ice sheet was intact 115,000 to 
130,000 years ago, during a brief interlude 
between ice ages. 

There is evidence that sea levels during 
that period were up to 9 metres higher than 
today’s — aphenomenon that Steig says is hard 


A view of Antarctica’s vast ice sheet. 


to explain without the loss of West Antarcti- 
ca’s ice. But hard evidence for the ice sheet’s 
collapse has been tough to come by. 


Cold case 


Glaciologists had hoped to capture that 
ancient warm period in an ice core that they 
finished drilling in West Antarctica in 2011 (see 
‘Climate clues’). But the core they extracted 


dated back only 68,000 years, possibly 
because the older ice had melted away. Mod- 
els suggest there is little or no melting in the 
deepest ice layers at Hercules Dome, which are 
likely to be at least 150,000 years old. 

“There may well be ice that is two million 
years old at the bottom,” Steig says. “Iwouldn’t 
be surprised.” But it’s not clear how much 
information scientists can extract from such 
ancient, and often very compressed, ice. 

One of the biggest questions at Hercules 
Dome is how the weather there changed when 
the climate warmed 130,000 years ago, says 
Jeffrey Severinghaus, a palaeoclimatologist 
at the Scripps Institution of Oceanography 
in La Jolla, California. Climate models sug- 
gest that the collapse of West Antarctica’s ice 
changed air circulation across the region, cre- 
ating stormier conditions at Hercules Dome. 

Severinghaus’s research has shown that the 
prevalence of isotopes of krypton and other 
trace gases deposited inice varies with atmos- 
pheric pressure. This suggests that scientists 
should be able to see evidence of storms inice 
from Hercules Dome. 

Drilling there could provide an unprece- 
dented opportunity to determine what hap- 
pened during a crucial period of Antarctica’s 
history, Severinghaus says. “This is a good 
analogue,” he adds, “for where humanity is 
headed right now.” 
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Feature 


THE 


CAUSATION 


DETECTOR 


A technique called Mendelian randomization 
has become the go-to for drawing lessons 
from epidemiological data. But are scientists 
overdoing it? By David Adam 


n 1812, the British ophthalmologist 
James Ware relayed a curious finding 
to the members of the Royal Society in 
London. Of thousands of young men 
recruited to regiments of the British 
army, only six had been turned away 
for poor vision in 20 years. But up to 
one-quarter of students about the same 
age going to the University of Oxford, UK, 
relied ona hand glass or spectacles’. 

Ware didn’t draw any conclusions about 
cause and effect: that poring over books 
might contribute to poor eyesight, for exam- 
ple, or that the bespectacled are naturally 
drawn to academic pursuits. And just as well. 
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Epidemiologists have long been frustrated 
by observations that link environmental 
exposures and health. Myopia is a classic 
example. Decades of studies show that chil- 
dren who spend the most time at school have 
the worst eyesight. But the data don’t reveal 
whether schooling makes children myopic 
or whether myopic kids spend more time at 
school. Or whether something else, such as 
socio-economic status, drives both. 

Fed up with this logical cul-de-sac, by the 
turnof this century some epidemiologists had 
begun suggesting that their field should call 
ita day. Advances in genetics, they said, could 
doa better job. 
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They were half-right. Two decades on, 
genetics has transformed how people untan- 
gle correlation from causation. But it has come 
to raise epidemiology, not bury it. Genetic 
differences, it turns out, can help remove con- 
founding variables from analyses, by standing 
inas proxies for environmental exposure. The 
technique is called Mendelian randomization. 

Scientists have used it to re-evaluate observa- 
tional data and draw fresh, firmer conclusions 
onlong-standing questions of cause and effect. 
The analyses have affirmed that lowcholesterol 
levels do not cause cancer’, for example, that 
drinking small amounts of alcohol does not 
protect the heart? and that — yes — schooling 
can make children short-sighted*. 

“Mendelian randomization in principle is a 
really, really cool idea. It attempts to solve one 
of the most daunting challenges in epidemiol- 
ogy,’ says Philipp Koellinger, a social-science 
geneticist at the Free University of Amsterdam. 


Gathering momentum 


George Davey Smith, a clinical epidemiologist 
at the University of Bristol, UK, who helped to 
pioneer the technique, says: “It came about 
because we were getting desperate and look- 
ing for ways of getting better causal inference 
inepidemiology.’ But, he says, there is adown- 
side, too. “The issue is that it became very 
simple to do.” 

He has been urging colleagues not to get 
carried away with Mendelian randomization. 
It’s a powerful tool, but one that must be 
used properly. As genetic data have piled up, 
a flurry of Mendelian randomization studies 
have emerged that don’t make the grade. Some 
have relied on misleading data, and others 
have failed to sufficiently test the assumptions 
onwhich Mendelian randomization relies. It’s 
time, many inthe field say, to tighten things up. 

Davey Smith was one of the scientists who 
suggested that epidemiology might have run 
its course. Writing in an editorial in the /nter- 
national Journal of Epidemiology, he anda 
co-author pointed out that the observational 
data on the possible harm or benefits of envi- 
ronmental exposures would repeatedly fail 
when interventions were tested in randomized 
controlled trials®. 

A few years after the article was published, 
that point came through loud and clearinthe 
high-profile failure of a US$100-million trial 


THE REPUTATION 

OF EPIDEMIOLOGY 
WAS COMING UNDER 
SCRUTINY. 
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called SELECT, which found that eating sele- 
nium supplements did not protect against 
prostate cancer — despite mountains of epide- 
miological evidence suggesting that it would®. 

“It was all rather depressing, and the 
reputation of epidemiology was coming under 
scrutiny,’ Davey Smith says. Researchers had 
suggested’ as early as 1986 that genetics could 
improve the interpretations. But it took the 
growth of genome-wide association studies 
(GWAS), which link genetic variants to specific 
traits, for the approach to gain traction. Last 
year, Davey Smith turned to Mendelian ran- 
domization to revisit the selenium-prostate 
cancer connection. 

Using genotype data for tens of thousands 
of men, the researchers found almost a dozen 
genetic variants that were associated with nat- 
urally higher levels of selenium in the blood?. 
From birth, these people had lived as if they 
were taking selenium supplements. The sci- 
entists could then compare the incidence 
of prostate cancer in people with these vari- 
ants to that ina control group without them. 
That allowed the researchers to focus more 
squarely on selenium levels and to ward off the 
influence of lifestyle factors, such as a healthy 
diet, that might influence both selenium levels 
and cancer risk. And, because the tendency 
to have high or low selenium levels was fixed 
in DNA, the analysis was less troubled by the 
likelihood of reverse causation: the possibil- 
ity that early stages of prostate cancer might 
influence selenium levels. 

The analysis found no benefit from 
selenium§, just as the SELECT trial had done’. 


Data bounty 


Such results can feed into decisions about 
whether to launch full clinical trials, Davey 
Smith argues. And Mendelian randomiza- 
tion can test hypotheses for which it would 
be unethical or impractical to carry out atrial. 

In principle, a Mendelian randomization 
analysis can be done wherever a genetic vari- 
ant can be found to naturally mimic the effects 
of an environmental exposure. And more of 
those are found every year — especially as 
millions of people around the world sign up 
to have their genomes analysed and their 
health tracked. That gives geneticists the sta- 
tistical power to identify genetic associations 
with everything from alcohol consumption to 
cholesterol levels. 

Now, epidemiologists and others are feeding 
these findings into more Mendelian randomi- 
zation tests. Data from Scopus and the Web of 
Science list fewer than 100 papers published 
per year onthe topic in 2010, growing to about 
200 by 2015. In 2019, so far, more than 500 
papers have used or discussed the method. 
Researchers have used the tests to tackle a 
number of questions typically confounded 
by life’s many variables. Studies have helped 
toshow more definitively that drinking alcohol 
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GENES AS PROXY 


Mendelian randomization uses gene variants to interrogate cause and effect in epidemiological data. For example, 
researchers can ask whether having low cholesterol increases cancer risk — as some data suggest — by looking at 
people genetically predisposed to low cholesterol. This strategy can help rule out reverse causation (that cancer 
lowers cholesterol), and it can bypass some variables that might influence both cancer risk and cholesterol. 
However, it relies on several assumptions, which must be tested. 
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Assumption 1 

The link between the 
variants and having low 
cholesterol has to be 
strong and stable over time. 


Assumption 2 


and cancer risk. 


The variants must not 
influence variables that might 
affect both cholesterol levels 


Assumption 3 

The variants must not be 
associated with cancer risk in any 
way other than through the 
relationship to cholesterol. 


can increase the risk of cancer’. Meanwhile, 
having low cholesterol does not’, despite some 
observations to the contrary (see ‘Genes as 
proxy’). 

As a prime example of how Mendelian 
randomization can help, many researchers 
point to myopia, which is a rapidly growing 
public-health issue. It’s been impossible to test 
its connection to schooling with arandomized 


ANSWER’. 


controlled trial, because it’s unethical to 
deliberately keep some children out of school. 

A chance to use Mendelian randomization 
came in 2016, after geneticists published 
data from two separate GWAS: one looking 
for genetic signatures related to educational 
attainment”; the other looking for genes 
associated with myopia". The studies looked 
at hundreds of thousands of people and found 
dozens of genetic variants robustly associated 
with myopia and years spent in school. 

The next year, epidemiologists used these 
variants to explore one of the biggest popula- 
tion data sets around — 488,000 middle-aged 
and older people who had signed up to the 
UK Biobank project. Volunteers have their 
genomes analysed and answer questions on 
dozens of personal details, including their 
education and eyesight. When the researchers 
— at the University of Cardiff and the Univer- 
sity of Bristol in the United Kingdom — used 
Mendelian randomization to analyse the 
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data, they found that being genetically prone 
to myopia made no difference to how many 
years people had spent at school. Those who 
carried the genes associated with educational 
attainment, however, were significantly more 
likely to be short-sighted’. 

Whether through time spent reading, lower 
levels of natural light or some other factor, 
time spent in school clearly influences vision, 
says Denize Atan, an ophthalmologist at Bristol 
who led the project. The link is so strong, she 
says, that policymakers and schools should do 
more to address it. 


The open secret 


The problem is, critics argue, that not all 
Mendelian randomization studies are as 
sound. “You need to have a robust hypothe- 
sis and some supporting evidence before you 
start,” Atan says. A growing number, she says, 
do not. “You think, ‘Where did they get that 
idea from?’ It just seems to come out of the 
blue.” This is a big problem, she adds, because 
Mendelian randomization allows research- 
ers to seek, find and publish relationships 
between unfamiliar data sets without any 
specialist knowledge of the relevant field. 

As Sonja Swanson, an epidemiologist at 
the Erasmus University Medical Center in 
Rotterdam, the Netherlands, puts it: “It doesn’t 
take much tojust hit the buttons and say ‘here’s 
a numeric answer to my question” 

Several epidemiologists say that it’s an open 
secret inthe field that many published Mende- 
lian randomization studies are problematic. 
“You can get papers published very easily,” says 
Davey Smith. “Some of the very poor papers 
are from researchers who don’t understand 
epidemiological principles.” In 2016, a Men- 
delian randomization study claimed to have 
found that high blood levels of C-reactive 
protein, aliver enzyme associated with inflam- 
mation, caused schizophrenia. It suggested 
that drugs capable of lowering levels of the 
enzyme inthe blood might help to treat people 
with the disease”. Davey Smith’s group and 


another did similar analyses, and found that, 
in fact, C-reactive protein had a protective 
effect against schizophrenia’. Davey Smith 
and his co-authors suggested that there were 
issues with how the initial group had com- 
bined genetic data sets, and the 2016 paper 
was eventually retracted. 

Epidemiologists have also criticized" 
Mendelian randomization analyses that 
claimed to have found that smoking while 
pregnant causes dramatic declines in baby 
birthweights® and substantially increases the 
risk of orofacial clefts in offspring”. 

The problem, they say, is that the genetic 
variants used as proxies for smoking behav- 
iour were identified in what are known as 
candidate-gene studies, in which research- 
ers evaluate a few genes that they suspect 
are involved in a behaviour such as smoking. 
Results from such studies can be unreliable 
because they are biased towards finding some 
effects in the genes being examined. The vari- 
ants that the authors used in their Mendelian 
randomization haven't shown up in larger, 
more comprehensive GWAS. 

George Wehby, a health-policy researcher 
at the University of lowa in lowa City, who led 
the smoking projects, says that the work was 
done before better data were available. “Iagree 
that these wouldn't be the first choice,” he says, 
“given current knowledge about genetics of 
smoking from large GWAS.” 


Against common sense 


To an economist, Mendelian randomization 
looks a lot like something called instrumental 
variable analysis, in which a variable referred 
to as the instrument is used to help unpick 
hidden relationships between two other 
observations. “When we saw that epidemiol- 
ogists were using genes as instrumental vari- 
ables, we were both intrigued and said, ‘Wait a 
moment!’” says Koellinger. Such analyses are 
built on assumptions that need to be carefully 
scrutinized. 

One central assumption in Mendelian 
randomization is that the genetic variants 
must not affect the outcome in any other way. 
For example, there is a variant of the gene that 
encodes the enzyme aldehyde dehydrogenase 
(ALDH2) that disrupts metabolism of alcohol. 
When people with this variant drink, they tend 
to feel nauseous, and so it’s associated with 
lower levels of alcohol consumption. That 
might seem a plausible way to test, for exam- 
ple, whether drinking raises blood pressure, 
because those who carry the variant generally 
drink less than those who don't. 

The problem is that ALDH2 also influences 
how likely someone is to smoke”, which inde- 
pendently influences blood pressure. This 
phenomenon, known as genetic pleiotropy, 
can invalidate Mendelian randomization 
results. And that creates a problem, because 
the extent of pleiotropy isn’t fully realized for 


many genes. 

Another assumption is that a given genetic 
variant has a strong effect. As bigger and more 
powerful GWAS dredge up weaker genetic links 
to different traits, this assumption becomes 
harder to test. 

A 2015 review by epidemiologists in the 
Netherlands of 178 published Mendelian ran- 
domization studies found that fewer than half 
adequately discussed these assumptions”. “As 
these assumptions are crucial for the validity 
of Mendelian randomization studies, they 
should always be discussed in the specific 
context of the study,” the researchers argue’®. 

Mendelian randomization is also subject to 
a distinct source of bias — one that’s a matter of 
life and death. People can die only once. This 
issue would complicate, for example, an anal- 
ysis of deaths from stroke. Such deaths tend 
to occur in older people, soa study of strokes 
will typically recruit people who have already 
survived conditions that affect younger peo- 
ple, suchas heart disease. Because stroke and 
heart disease have common causes, such as 
high cholesterol (and therefore common ther- 
apies, including statins), this survivor bias can 
throw up some misleading results. 

To demonstrate the effects of this bias, Mary 
Schooling, a public-health epidemiologist at 
the City University of New York, ran Mendelian 
randomization tests in which gene variants 
linked to reduced cholesterol stood in for sta- 
tin use. People with this beneficial inheritance 
have fewer heart attacks early in life and live 
to an age at which the risk of stroke rises. So, 
the study concluded that cholesterol-lowering 
statins would actually cause strokes”. 

“It didn’t make any sense,” Schooling says. 
Proper randomized controlled trials aren’t 
confused in this way: they show that statins 
protect against stroke. But Mendelian rand- 
omization shows a survivor bias that must be 
identified and corrected for. 


Battling biases 


“Every single method can be biased,” says 
Davey Smith. Mendelian randomization, he 
says, is not intended to replace randomized 
controlled trials, but, alongside other sources, 
including observational studies, they can 
add to the evidence available to help make 
an informed decision. Now, researchers are 
looking for ways to improve them. 

One way is to identify and correct for some 
of the biases, and to apply statistical tools 
for testing the strength of the assumptions. 
Davey Smith points to papers that can help 
researchers to assess the quality of Mendelian 
randomization studies for themselves””. 

Better organization of data can help, too. 
Unbiased analyses assume that genes are ran- 
domly distributed, but some genes are known 
to cluster in geographical regions”. Already, 
genotype data sets are becoming available 
that are grouped by extended families, and 
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Mendelian randomization studies of these 
data are identifying, for example, that height 
and body mass index might not influence 
educational attainment as much as previous 
studies had suggested”. 

By comparing the results of within-family 
and population-based studies, geneticists 
can help to distinguish the roles of nature and 
nurture in a given trait. “Particular genes are 
correlated with particular features of the local 
environment. And if you want to use genes for 
causal inference you need to break that link,” 
says Koellinger. 

This kind of accuracy is important if 
researchers want to harness the growing tor- 
rent of genetic information for public-health 
and policy recommendations. But even these 
tools need to be improved and supplemented. 

Ware’s observations 200 years ago on the 
eyesight of students and soldiers have been 
explained through a genetic lens that no one 
could have imagined at the time. Ironically, 
it took the British army another century to 
accept recruits who need to wear glasses, and 
to change its standards for what it considers 
adequate vision. Even during the First World 
War, some authorities argued it didn’t matter 
if a British soldier couldn't see clearly what he 
was shooting at, as long as he could “fire inthe 
right direction”. 

Statistical tools for epidemiology areimprov- 
ing. And although Mendelian randomization 
does not always offer perfect clarity, it might, 
at least, point researchers in the right direction. 


David Adam is a freelance journalist based 
near London. 
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INCRISIS 


A promising field of research on social behaviour 
floundered after investigators couldn't repeat key 
findings. Now researchers are trying to establish what’s 
worth saving. By Tom Chivers 
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hree years ago, a team of psycholo- 
gists challenged 180 students with 
a spatial puzzle. The students could 
ask for a hint if they got stuck. But 
before the test, the researchers 
introduced some subtle interven- 
tions to see whether these would 
have any effect. The psychologists 
split the volunteers into three groups, each of 
which had to unscramble some words before 
doing the puzzle. One group was the control, 
another sat next to a pile of play money and 
the third was shown scrambled sentences that 
contained words relating to money. 

The study, published this June’, was a careful 
repeat of a widely cited 2006 experiment’. The 
original had found that merely giving students 
subtle reminders of money made them work 
harder: in this case, they spent longer on the 
puzzle before asking for help. That work was 
one among scores of laboratory studies which 
argued that tiny subconscious cues can have 
drastic effects on our behaviour. 

Known by the loosely defined terms ‘social 
priming’ or ‘behavioural priming’, these stud- 
ies include reports that people primed with 
‘money’ are more selfish’; that those primed 
with words related to professors do better 
on quizzes’; and even that people exposed to 
something that literally smells fishy are more 
likely to be suspicious of others’. 

The most recent replication effort!, how- 
ever, led by psychologist Doug Rohrer at the 
University of South Florida in Tampa, found 
that students primed with ‘money’ behave 
no differently on the puzzle task from the 
controls. It is one of dozens of failures to 
verify earlier social-priming findings. Many 
researchers say they now see social priming 
not so muchas a way to sway people’s uncon- 
scious behaviour, but as an object lesson in 
how shaky statistical methods fooled scien- 
tists into publishing irreproducible results. 

This is not the only area of research to be 
dented by science’s ‘replication crisis’. Failed 
replication attempts have cast doubt on 
findings in areas from cancer biology to eco- 
nomics. But so many findings in social priming 
have been disputed that some say the field is 
close to being entirely discredited. “I don’t 
know a replicable finding. It’s not that there 
isn’t one, but I can’t nameit,” says Brian Nosek, 
a psychologist at the University of Virginia in 
Charlottesville, who has led big replication 
studies. “I’ve gone from full believer to full 
sceptic,” adds Michael Inzlicht, a psychologist 
at the University of Toronto, Canada, and an 
associate editor at the journal Psychological 
Science. 

Some psychologists say the pendulum has 
swung too far against social priming. Among 
these are veterans of the field who insist that 
their findings remain valid. Others accept 
that many of the earlier studies are in doubt, 
but say there’s still value in social priming’s 
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central idea. It is worth studying whether it’s 
possible to affect people’s behaviour using 
subtle, low-cost interventions — as long as 
the more-outlandish and unsupported claims 
can be weeded out, says Esther Papies, a 
psychologist at the University of Glasgow, UK. 
Equipped with more-rigorous statistical 
methods, researchers are finding that 
social-priming effects do exist, but seem to 
vary between people and are smaller than first 
thought, Papies says. She and others think that 
social priming might survive as a set of more 
modest, yet more rigorous, findings. “I’m quite 
optimistic about the field,” she says. 


Rise and fall 


The roots of the priming phenomenon go back 
to the 1970s, when psychologists showed that 
people get faster at recognizing and process- 
ing words if they are primed by related ones. 
For instance, after seeing the word ‘doctor’, 
they recognized ‘nurse’ faster than they did 
unrelated words. This ‘semantic’ priming is 
now well established. 

But in the 1980s and 1990s, researchers 
argued that priming could affect attitudes and 
behaviours. Priming individuals with words 
related to ‘hostility’ made them more likely 
tojudge the actions of acharacterinastory as 
hostile, a1979 study found. And in1996, John 
Bargh, a psychologist at New York University 
in New York City found that people primed 
with words conventionally related to age in 
the United States — ‘bingo’, ‘wrinkle’, ‘Florida’ 
— walked more slowly than the control group 
as they left the lab, as if they were older’. 

Dozens more studies followed, finding that 
priming could affect how people performed 
at general-knowledge quizzes, how gener- 
ous they were or how hard they worked at 


tasks. These behavioural examples became 
known as social priming, although the termis 
disputed because there is nothing obviously 
social about many of them. Others prefer 
‘behavioural priming’ or ‘automatic behaviour 
priming’. 

In his 2011 best-seller Thinking, Fast and 
Slow, Nobel-prizewinning psychologist 
Daniel Kahneman mentioned several of the 
best-known priming studies. “Disbelief is not 
an option,” he wrote of them. “The results are 
not made up, nor are they statistical flukes. You 
have no choice but to accept that the major 
conclusions of these studies are true.” 

But concerns were starting to surface. That 
same year, Daryl Bem, a social psychologist at 
Cornell University in Ithaca, New York, pub- 
lished a study suggesting that students could 
predict the future®. Bem’s analysis relied on 
statistical techniques that psychologists regu- 
larly used. “I remember reading it and thinking 
‘Fuck. If we can do this, we have a problem,” 
says Hans IJzerman, a social psychologist at 
the University of Grenoble Alps in Grenoble, 
France. 

Also that year, three other researchers 
published a deliberately absurd finding: that 
those who listened to the Beatles song ‘When 
I’m Sixty-Four’ literally became younger than 
a control group that listened to a different 
song. They achieved this result by analysing 
their datain many different ways, getting a sta- 
tistically significant result in one of them by 
simple fluke, and then not reporting the other 
attempts’. Such practices, they said, were com- 
monin psychology and allowed researchers to 
find whatever they wanted, given some noisy 
data and small sample sizes. 

The papers had an explosive impact. Repli- 
cation efforts that cast doubt on key findings 


WANING EFFECT 


A meta-analysis of 246 experiments that exposed people to money-related stimuli found that early studies reported 
larger priming effects on behaviour, emotions and attitudes than did later ones. It also revealed larger effects in 
published work than in unpublished experiments provided by authors of the original studies. 
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*Effect size measured by a value known as Hedges’ g, where ‘1’ indicates that primed and control groups differed by 1 standard deviation. 
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started to appear, including a 2012 report that 
repeated Bargh’s ageing study and found no 
effect of priming unless the people observing 
the experiment were told what to expect®. It 
did not help that this all took place as it was 
discovered that a leading social psychologist 
inthe Netherlands, Diederik Stapel, had been 
faking data for years. 

In 2012, Kahneman wrote an open letter to 
Bargh and other “students of social priming”, 
warning that “a train wreck” was approach- 
ing. Despite being a “general believer” in the 
research, Kahneman worried that fraud such 
as Stapel’s, replication failures and atendency 
for negative results not to get published had 
created “a storm of doubt” (see go.nature. 
com/2jcoetz). 

Seven years later, the storm has uprooted 
many of social priming’s flagship findings. 
Eric-Jan Wagenmakers, a psychologist at the 
University of Amsterdam, says that when he 
read the relevant part of Kahneman’s book, 
“I was like, ‘not one of these studies will 
replicate. And so far, nothing has.” 

Psychologist Eugene Caruso reported in 
2013 that reminding people of the concept 
of money made them more likely to endorse 
free-market capitalism’. Now at the Univer- 
sity of California, Los Angeles, Caruso says 
that having tried bigger and more systematic 
tests of the effects, “there does not seem to 
be robust support for them”. Ap Dijksterhuis, 
a researcher at Radboud University in 
Nijmegen, the Netherlands, says that his 
paper’ suggesting that students primed with 
the word ‘professor’ do better at quizzes “did 
not pass the test of time”. 

Kahneman told Nature: “lam not upto date 
on the most recent developments, so should 
not comment.” 

Researchers had been whispering about 
not being able to repeat big findings years 
before the priming bubble began to burst, 
says Nosek. Afterwards, inlessons shared with 
science’s wider replication crisis, it became 
clear that many of the problematic findings 
were probably statistical noise — fluke results 
garnered from studies on too-small groups 
of people — rather than the result of fraud. It 
seems that many researchers were notalert to 
how easy it is to find significant-looking but 
spurious results in noisy data. This is especially 
so if researchers ‘HARK’ (Hypothesize After 
Results are Known) — that is, change their 
hypotheses after looking at their data. The fact 
that journals tend not to publish null results 
didn’t help, because it meant the only findings 
that got through were the surprising ones. 

There is also evidence that subconscious 
experimenter effects have been a problem, 
Papies says: one study found that when experi- 
menters were aware of the priming effect they 
were looking for, they were much more likely 
to find it, suggesting that, subconsciously, 
they would affect the results in some way”. 
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Since then, there have been widespread 
moves throughout psychology to 
improve research methods. These include 
pre-registering study methods before looking 
at data, which prevents HARKing, and working 
with larger groups of volunteers. Nosek, for 
instance, has led the Many Labs project, in 
which undergraduates at dozens of labs try 
to replicate the same psychology studies, giv- 
ing sample sizes of thousands. On average, 
about half of the papers that Many Labs looks 
at can be replicated successfully. Other col- 
laborative efforts include the Psychological 
Science Accelerator, a network of labs that 
work together to replicate influential studies. 


The new social priming 


Today, much of the work being done in social 
priming involves replications of earlier work, 
or meta-analyses of multiple papers to try to 
tease out what still holds true. A meta-analysis 
of hundreds of studies on many kinds of 
money priming, reported this April”, found 
little evidence for the large effects the early 
studies claimed. It also found larger effects in 
published studies than in unpublished exper- 
iments that had been shared with the authors 
of the meta-analysis (see ‘Waning effect’). 

Original work hasn’t dried up entirely, says 
Papies, although the focus is changing. Much 
of the high-profile social-priming work of 
the past was designed to find huge, universal 
effects, she says. Instead, her group’s studies 
focus on finding smaller effects in the subset of 
people who already care about the thing being 
primed. She has found that people who want 
to become thinner are more likely to make 
healthy food choices if they are primed, say, 
with words ona menu suchas ‘diet’, ‘thin’ and 
‘trim figure’. But it works only in people for 
whom ahealthy diet is a central goal; it doesn’t 
make everyone avoid fattening foods”. 

This matches the findings of a meta-analysis 
from 2015, led by psychologist Dolores 
Albarracin at the University of Illinois at 
Urbana-Champaign. It looked at 352 prim- 
ing studies that involved presenting words 
to people, and it found evidence of real, if 
small, effects when the prime was related toa 
goal that the participants cared about”. That 
analysis, however, deliberately looked only at 
experiments in which the priming words were 
directly related to the claimed effect, such 
as rudeness-related words leading to ruder 
behaviour or attitudes. It avoided looking at 
studies with primes that had what it termed 
‘metaphorical’ meaning — including the age- 
ing-related words that Bargh said led to slower 
walking, or the money-related priming work. 

Research into priming has declined, how- 
ever, and what is considered priming is not 
always the same as the startling claims of 
the 1990s and 2000s. “There’s a lot less than 
there was five or ten years ago,” says Antonia 
Hamilton, a neuroscientist at University 
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College London, who still works on priming. 
Partly, she says, that’s because of the replica- 
tion problems: “We doless since it all blew up. 
It’s harder to make people believe it and there 
are other topics that are easier to study.” It 
might also be simply that the topic has become 
less fashionable, she says. 

Hamilton’s own work involves, among other 
things, putting people in functional magnetic 
resonance imaging (fMRI) scanners to see how 
priming affects brain activity. In one 2015 
study, she used a scrambled-sentence task to 
prime ‘prosocial’ ideas (suchas ‘helping’) and 
‘antisocial’ ones (such as ‘annoying’), seeing 
whether it made participants quicker to mimic 
other people’s actions, and whether there were 
detectable differences in brain scans”. 

Using fMRI is only practical with small 
numbers of volunteers, so she looks at 
how the same people respond when they 
have been primed and when they haven't: a 
‘within-subjects’ design, in contrast to the 
‘between-subjects’ design of priming studies 


“I still have no doubts 
whatsoever that in real life, 
behaviour priming works.” 


that use a control group. The design means 
that researchers don’t have to worry about 
pre-existing differences between groups, 
Hamilton says. Her research has found priming 
effects: people primed with prosocial concepts 
behave in more prosocial ways, and fMRI scans 
did show differences in activity in brain areas 
such as the medial prefrontal cortex, which is 
involved in regulating social behaviours. But, 
she says, the effects are more modest than 
those the classic priming studies found. 
Some researchers say that however efforts to 
test older results pan out, the concept of social 
or behavioural priming still has merit. “I still 
have no doubts whatsoever that in real life, 
behaviour priming works, despite the fact that 
inthe old days, we didn’t study it properly rel- 
ative to current standards,’ says Dijksterhuis. 
Bargh says that despite many researchers 
now discounting them, important early 
advances do exist — such as his own 2008 
study, which reported that holding warm 
coffee made people behave more warmly 
towards others”. Direct replications have 
failed to support the result”, but Bargh says 
that a link between physical warmth and social 
warmth has been demonstrated in other work, 
including neuroimaging studies. “People say 
we should just throw out all the work before 
2010, the work of people my age and older,” 
says Bargh, “and I don’t see how that’s justi- 
fied.” He and Norbert Schwarz, a psychologist 
at the University of Southern California in 
Los Angeles, say that there have been repli- 
cations of their earlier social-priming results 
— although critics counter that these were not 
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direct replications but ‘conceptual’ ones, in 
which researchers test a concept using related 
experimental set-ups. 

Bargh says that results of social priming 
are still widely believed and used by non-aca- 
demics, suchas political campaigners and busi- 
ness marketers, even when they are sceptical. 
Gary Latham, for instance, an organizational 
psychologist at the University of Toronto in 
Canada, says: “I strongly disliked Bargh’s 
findings and wanted to show it doesn’t work.” 
Despite this, he says, he has for ten years con- 
sistently found that priming phone marketers 
with words related to ideas of success and 
winning increases the amount of money they 
make”. But Leif Nelson, a psychologist at the 
University of California, Berkeley, emphasizes 
that whether or not social-priming ideas are 
subsequently confirmed, the classic studies in 
the field were not statistically powerful enough 
to detect the things they claimed to find. 

Bargh sees positives and negatives in 
how psychology research has changed. “If 
pre-registration stops people from HARKing, 
then | guess it’s good,” he says, “but it always 
struck me as an insult. ‘We don’t trust you to 
be honest’; it feels like we’re being treated like 
criminals, wearing ankle bracelets.” 

Others disagree. The move towards open, 
reproducible science, according to most 
psychologists, has beena huge success. Social 
priming as a field might survive, but if it does 
not, then at least its high-profile problems 
have been crucial in forcing psychology to 
clean up its act. “I have to say | am pleasantly 
surprised by how far the field has come in 
eight years,” says Wagenmakers. “It’s beena 
complete change in how people dothings and 
interpret things.” 


Tom Chivers is a science journalist based in 
London. 
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Science in culture 


Books & arts 


Pepper the robot greets visitors to Berlin’s Futurium. 


Lost inthe house 
of tomorrow 


What kind of future does Berlin’s newest 
museum really show us? By Stephen Cave 
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hirty years ago, the future became 
passé. When the Berlin Wall fell in 
late 1989 and the communist regimes 
that hid behind it collapsed, political 
scientist Francis Fukuyama called the 
event “the end of history”. But he also cast it 
as the finale of the future: the end of imagin- 
ing how things might be different. The uto- 
pian visions driving both communism and 
fascism had been discredited and defeated. 
They were to be replaced by an eternal ‘now’ 
that, in Fukuyama’s words, saw “Western lib- 
eral democracy as the final form of human 
government”. 

In the intervening decades, past and 
future have been reasserting themselves. 
Throwbacks such as right-wing populism are 
challenging liberal complacency, and environ- 
mental apocalypse is emerging as the face of 
the future. But most people in high-income 
countries would rather avert their eyes to 
focus on a still-comfortable consumerist 
present. At sucha time, imagining a positive 
future is a challenge, but a new initiative in 
Berlin wants to try. 

The Futurium, billed as a “house of futures”, 
is amuseum, exhibition space and site for 
education and conferences. The €58-million 
(US$64-million) building, designed by Berlin 
architects Richter Musikowski, shimmers in 
asymmetrical glass and metal, like a 1960s 
vision of the space age squashed between 
office blocks on the banks of the River Spree. 
It opened in September in the presence of Anja 
Karliczek, head of Germany’s Federal Minis- 
try for Education and Research, the museum’s 
largest funder. Other backers include leading 
scientific institutes such as the Fraunhofer 
Society, Helmholtz Association and Max 
Planck Society. 

How do these prestigious bodies invite us 
to imagine the future? Oddly timidly, in my 
view — with a focus on new technologies and 
modest hints for living sustainably. The open- 
ing exhibition reveals deeper issues with the 
thinking behind the space. 

The Futurium’s three floors are divided into 
abasement ‘lab’ for workshops, a ground-floor 
‘forum’ for larger events and an exhibition 
space above. The last is divided into three 
‘thinking spaces’: Human, Nature and Tech- 
nology, each with its own exhibition — respec- 
tively, Common Cause, Rethinking Nature and 
Towards New Horizons. A humanoid robot 
(‘Pepper’, made by SoftBank Robotics in Paris) 
acts as greeter, dispensing wristbands that 
contain implanted radio-frequency identifi- 
cation chips. At various points in the show, vis- 
itors can vote fora possible future by pressing 
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Philip Beesley’s cell-simulating 2018 sculpture Noosphere on display at the Futurium. 


their wristband against a scanner. For example, 
they can choose whether they would prefer to 
live on after death as a digital avatar, pass their 
data to their loved ones or have all electronic 
traces of themselves deleted. 

Past Pepper, the layoutis baffling, with little 
indication of what is where, or why. When | 
stumbled across the Technology thinking 
space, however, I found the exhibits inform- 
ative, with concepts such as big data and arti- 
ficial intelligence explained well. Visitors of 
all ages are kept engaged by various gadgets, 
such as tablet computers that provide an 
augmented view of some displays. 


Connected realms 


The Human space invites us to consider 
topics such as the networked society and 
future cities. A central focus is overconsump- 
tion, with a rather messy series of exhibits 
prompting visitors to consider how much 
stuff they really need, and promoting the 
benefits of decluttering. In the Nature area, 
themes include urban farming and renewable 
energy. An exhibit on sustainable biomateri- 
als presents some provocative alternatives 
to plastics — shoes made from algae, and 
fungus-based furniture. The curators intend 


to update the whole exhibition gradually, 
and to introduce major new themes annually 
from 2021. These individual elements are 
stimulating. 

But I found the conceptual basis for the 
Futurium’s structure — the division into 
human, technology and nature — problem- 
atic. An installation at the start of the exhibi- 
tion displays the three concepts as linked but 


“To portray technology as 
unconnected to humans 
fuels technological 
determinism.” 


distinct, with ‘technology’ far removed from 
‘people’, and ‘natural’ nodes such as ecosys- 
tems or even the laws of nature ona par with 
art and faith. These separations are deeply 
misleading. To portray technology as uncon- 
nected to humans fuels technological deter- 
minism; to portray it as separate from nature 
is to perpetuate the Western myth that tech- 
nology exists to dominate an unruly Earth. 
Prising the human away from nature is 
perhaps the most egregious of all; it implies 
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that we are self-contained, rather than depend- 
ent ona biosphere increasingly under threat. 
Other framings, such as humanity as a sub- 
set of nature, and technology as a subset of 
the human, might have been better suited to 
the messages about sustainability that the 
museum ultimately wants to promote. 

Overall, the Futurium succeeds best as 
a showcase for the shiniest aspects of the 
present. In this way, it resembles other 
tech-engagement centres, suchas Science Gal- 
lery Dublin andiits six sister venues around the 
world, or Tokyo’s National Museum of Emerg- 
ing Science and Innovation. But it claims to 
be something more: a place for co-imagining 
alternative futures. To succeed, it will need to 
be bolder. Even though the Berlin landscape is 
dotted with monuments to failed ideologies, 
suchas the Stasi Museum, history did not end 
when the wall fell. To imagine new futures, this 
museum must free itself from the conceptual 
frameworks of the past. 


Stephen Cave is executive director of 

the Leverhulme Centre for the Future of 
Intelligence at the University of Cambridge, 
UK. 

e-mail: sjc53@cam.ac.uk 
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CRISPR: the movie 


A documentary showcases biology’s hottest tool — up 
to the point when things went awry. By Amy Maxmen 


t the start of Human Nature, a docu- 

mentary about the gene-editing tool 

CRISPR, we meet a young man with 

sickle-cell anaemia. David Sanchezis 

wise beyond his years, driving home 
the injustice of his gruelling blood infusions 
and shortened lifespan. Researchers are test- 
ing atherapy for his condition in clinical trials 
using CRISPR. 

This is a film probing the unknown future 
of atechnology that, in the past decade, has 
skyrocketed from obscurity to become the 
subject of a Netflix series called Unnatural 
Selection, which debuted on 18 October (the 
trailer promises provocation by leading with 
biohackers injecting the editing tool). Human 
Nature does not take ashock-horror approach. 
This is the film that scientists would probably 
prefer the public to see. 

The project began with a meeting 
between the Wonder Collaborative, a sci- 
entific-documentary organization based 
in San Francisco, California, and CRISPR 
co-discoverer Jennifer Doudna, a biochemist at 
the University of California, Berkeley, and her 
colleagues. The scientists guided the film-mak- 
ers, led by a team of co-producers (including 
former cell biologist Sarah Goodwin and 
journalists Dan Rather and Elliot Kirschner) 
and director Adam Bolt, on the scientific and 
ethical issues. And the film-makers read up 
onthe technology themselves. When the film 
was nearly complete, they sought feedback 
from members of the US National Academies 
of Sciences, Engineering, and Medicine. 

As a result, the film is high on the thrill and 
potential of discovery, and every scientist and 
bioethicist featured is passionate and thought- 
ful. Notably, the documentary sidesteps aca- 
demic politics, such as the ongoing patent 
battle over who will reap CRISPR’s financial 
rewards. Andit lacks input from scientists out- 
side the United States and western Europe — 
such as from China, where the first human 
embryos have been edited — or from policy- 
makers with the power to restrict what research 
is permissible. However, it does feature a hand- 
ful of people who have genetic disorders, as 
wellas parents of children with these maladies. 


Straight to the point 

Human Nature follows a straight path, begin- 
ning with the seeds of genetic engineering 
in the 1960s and ending in the ethical quag- 
mires of making changes to human DNA that 
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Human Nature 
Director: Adam Bolt (2019) 
https://wondercollaborative.org/ 


future generations could inherit. The film will 
be educational for people who haven't heard 
of CRISPR before; little will be new for those 
who have. Having covered the story from the 
beginning, I enjoyed watching the who’s who 
of CRISPR’s early years speak for themselves. 
And the technology itself is covered beau- 
tifully. The camera swoops over crystalline 
salt pools in Spain, where Francisco Mojica, 
a microbiologist at the University of Alicante, 
ponders the repeated genetic letters in 
bacterial genomes — and the ‘spacers’ of uni- 
dentified DNA sequences in between. In 2005, 
he revealed that these enigmatic spacers mimic 
genes from viruses that once infected the 
microbes, and that these sequences forma kind 
of ‘memory’ that allows bacteria to recognize 
and attack the invaders in future. 
Animations of double helices clarify what 
happens; they reappear when Doudna’s 
team adapts CRISPR as a laboratory work- 
horse. Even this scene has a pulse, owing to 
the ever-enthusiastic Fyodor Urnov, science 
director at the Innovative Genomics Institute, 
Berkeley's centre focused on CRISPR. Together 
withthe enzyme Cas9, CRISPR spots and snips 
genes that it’s been programmed to find. 
Feng Zhang at the Broad Institute of MIT and 
Harvard in Cambridge, Massachusetts, appears 
briefly. But scant attention is devoted to his 
groundbreaking studies from 2013, which 


enabled CRISPR to edit animal DNA (this is 
the work at the centre of the Broad’s contested 
patents). And although a montage of headlines 
shows a subsequent flurry of experiments in 
which scientists edit various organisms, little 
is said about how these studies have pushed 
forward fields ranging from evolutionary 
biology to agriculture. Nor do we learn about 
scientists’ continuing struggle to use CRISPR 
to insert genes in many organisms other than 
mice and flies. Their attempts often fail, or 
result in side effects or death. 

The film-makers probably feared that audi- 
ences would grow weary of details. But | was 
hungry for an update on what scientists are 
actually doing right now. That’s especially true 
when it comes to editing people. But rather 
than dwell onacouple of dozen CRISPR-based 
therapies in the early stages of testing, the film 
bounds into the possibilities of engineering 
humans using CRISPR with an ominous clip of 
Russian President Vladimir Putin speaking in 
2017. Soldiers, he says, could be endowed with 
the ability to fight without pain. And Urnov 
explains how: CRISPR could be used to deletea 
gene that transmits pain signals to the brain. He 
says he’s sure that this will become an analgesic 
treatment offered to cancer patients in pain. 


Ethical dilemma 

The rest of the film centres on the ethics of 
human editing, as if everything were possible. 
George Daley, astem-cell biologist at Harvard 
Medical School in Boston, Massachusetts, 
draws aline between editing embryos and edit- 
ing adults; inthe former case, children would 
be born with the alteration in all their cells, 
and changes would be passed down through 
generations. 

Allthe scientists (the protagonists of this 
film) are conscientious. Doudna, for example, 
says that Adolf Hitler showed up in one of the 
nightmares she had that were triggered by her 
anxiety over the potential misuse of the tool. 
In 2015, Urnov and his colleagues argued that 


David Sanchez has sickle-cell anaemia, which could one day be addressed using gene editing. 
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editing embryos, sperm and eggs should be 
banned for now (see E. Lanphier et al. Nature 
519, 410-411; 2015). And earlier this year, Zhang 
and others recommended that scientists 
come up with a framework that governments 
could use to evaluate research proposals as 
the science of gene editing progresses (see E. 
Lander et al. Nature 567, 165-168; 2019). 

Still, other scientists can’t contain their 
excitement. Stephen Hsu, a co-founder 
of Genomic Prediction, a genetic-testing 
company for in vitro fertilization in North 
Brunswick, New Jersey, suggests that, 
eventually, editing could advance humanity 
by making people healthier, longer-lived or 
more intelligent. The documentary pushes him 
on this point, and flashes a Nazi propaganda 
clip. Hsu counters that his vision is different 
from eugenics because the choice to edit is 
made by parents. 


Troubling development 


Alta Charo, a bioethicist at the University of 
Wisconsin-Madison, also dismisses certain 
fears, pointing out, for example, that charac- 
teristics suchas intelligence are controlled by 
multiple genes and by the environment. But 
she concedes that thereis a risk to editing, and 
therefore it shouldn’t be used frivolously. Just 
30 out of 195 countries have banned the editing 
ofhumanembryos, sperm and eggs inthe clinic 
with CRISPR, and the rules might not govern 
pure research. 

Human Nature traces CRISPR up toa pivotal 
moment. The film was nearly finished when the 
news was reported last November that twin 
girls had been born after Chinese biophysicist 
He Jiankui had used CRISPR to edit their 
embryos. So the film-makers just spliced this 
development in as an afterthought. After the 
heroes of the film had spent so much time 
expounding on the need to prevent this out- 
come, its sudden fruition is troubling. 

Asked why the film-makers didn’t revise the 
documentary to focus on the case, Kirschner 
says that they decided there was value in what 
they had: a film on CRISPR’s origins. Plus, we 
truly don’t know what will happen next. Kirsch- 
ner writes: “It is impossible to tell whether it 
will ultimately be seen as an inflection point 
or an aberration.” 

The Wonder Collaborative had considered 
creating just a brief CRISPR explainer. I’m glad 
the group opted for a full-length feature: it 
gives them time to strike a nerve. For me, this 
happened in the scenes with Sanchez. At the 
end of the film, after so many researchers have 
gushed about the power of CRISPR to cure dis- 
ease, the interviewer asks Sanchez if he wished 
his parents had used the tool to prevent his 
being born with a deadly condition. Sanchez 
pauses, and says no: “I don’t think I'd be me.” 


Amy Maxmen writes for Nature from 
Oakland, California. 


Books in brief 


LS. FAUBER 


HEAVEN 


Heaven on Earth 

J. S. Fauber Pegasus (2019) 

Four towering sixteenth-century scientists — Nicolaus Copernicus, 
Tycho Brahe, Johannes Kepler and Galileo Galilei — discovered 
heliocentrism at a time of sociopolitical tumult. As J. S. Fauber drives 
home in this dynamic science history, their intermeshed stories form 
a mighty “intergenerational epic” sweeping in the likes of Brahe’s 
sister Sophie and Galileo's daughter Virginia. A wonderfully wrought 
explication of how a powerful thesis began its journey to becoming 
unavoidable fact, and seeded modernity in the process. 


CHARLES 
FRANKEL 


Volcanoes . 


Wine 


Volcanoes and Wine 

Charles Frankel Chicago University Press (2019) 

For this intriguing exploration of volcanism and viniculture, Charles 
Frankel scoured geologically active regions to trace how soils and 
landforms shape local wines. He begins with the 1620 Bc eruption 

on Santorini that left the Greek island little more than a caldera, 

yet created ideal conditions for growing Assyrtiko grapes, used in 
unctuous Vinsanto. No less gripping are Frankel’s descriptions of 

the deep-time lava flows and flooding that formed Oregon's Pinot- 
growing Willamette Valley. A gem for geologists and wine buffs alike. 


ANTIMONY 
GOLD 
JUPITERS 
WOLF 


WOTHERS 


Antimony, Gold and Jupiter’s Wolf 

Peter Wothers Oxford University Press (2019) 

Hydrogen, caesium, silver: how were elements named? In this 
stimulating chemical chronicle, Peter Wothers unravels tangled 
etymologies. Eighteenth-century chemist Antoine Lavoisier, for 
instance, named oxygen to signify ‘acid-former’, only to have the 
word construed as “the son of a vinegar merchant”. W, the symbol 
for tungsten, is a nod to its traditional moniker wolfram (derived from 
‘wolf-foam’). From copper to californium, we discover how the sober 
face of the periodic table hides dramatic backstories. 


How Nonatec! 
Transforming Medi 


Nano Comes to Life 

Sonia Contera Princeton University Press (2019) 

Nanotechnology researcher Sonia Contera’s succinct study surveys 
the progress of nano-tools in biological and medical research. As 

she relates, there is much in development: DNA technology aimed at 
crafting nanoscale machines to target specific cancer cells; nano- 
antibiotics for fighting infection; and nano-approaches to tissue 
engineering. Contera frames this near-future transmaterial science, 
with its focus on human well-being, as an effort allied to social justice 
even as it probes existential questions of what it means to be human. 


CARTOONS ON CHEMISTRY 
by SIDNEY HARRIS 


Eureka! Details to Follow 

Sidney Harris Science Cartoons Plus (2019) 

After a year of bouts at the bench (and blasts of bad news), you may 
need relief. Science cartoonist Sidney Harris — whose work has 
graced the pages of many journals, including Nature, offers an ace 
antidote in this irreverent look at chemistry. Here is Lewis Carroll’s 
Alice, thwarted by a looking-glass made of Lexan polycarbonate 
resin; two chemists absorbing the news that “you can’t both be 

the ‘father’ of ammonium pentoxide phosphate”; the Institute for 
Advanced Hindsight; and oodles more. Barbara Kiser 
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Obituary 


Margarita Salas 


(1938-2019) 


Biochemist whose discoveries led to faster, more-accurate DNA testing. 


argarita Salas discovered a new 

mechanism for the replication of 

DNA. The enzyme she isolated as 

the key to it has transformed the 

process of amplifying DNA from 
very small samples, and is now widely used in 
forensics, studies of ancient DNA and oncol- 
ogy, as wellas in basic research. Her invention, 
the most profitable patent ever filed by the 
Spanish National Research Council (CSIC), 
was recognized earlier this year by a lifetime 
achievement award from the European Patent 
Office. 

Salas, who died on 7 November, aged 80, 
was central in bringing modern molecular 
biology to Spain. Coming of age during the 
regime of General Francisco Franco, who 
denied women an equal role inthe workplace, 
she blazed a path for female scientists in her 
country. She was one of the first winners of the 
L’Oréal-UNESCO awards for womenin science, 
and in 2008, King Juan Carlos | awarded her the 
hereditary title of marquise, in recognition of 
her impact on Spanish society. 

Salas was born in Canero, a small village in 
the Asturias region in the north of Spain, in 
1938, in the middle of the Spanish Civil War. 
Her mother was a schoolteacher; her father, 
a physician, influenced her interest in sci- 
ence. In 1955, she went to Madrid University 
to study chemistry. In 1958, ona visit home, 
she had an inspirational meeting with a dis- 
tant relative — the biochemist Severo Ochoa, 
who was awarded a Nobel prize the following 
year. Fascinated by the emerging discoveriesin 
biochemistry, Salas did a PhD on yeast metabo- 
lisminthe laboratory of enzymologist Alberto 
Solsin Madrid. There she met and married the 
biochemist Eladio Vihuela. 

In 1964, Salas and her husband were 
accepted as postdoctoral fellows in Ochoa’s 
lab at New York University. The lab was deci- 
phering the mechanisms that transfer genetic 
information from DNA to protein — replica- 
tion, transcription and translation. Salas 
contributed two key pieces of information. 
First, she found that these mechanisms read 
the linear DNA message in only one direction. 
Second, she helped to show that the triplet of 
RNA nucleotides UAA represents astop codon, 
where protein synthesis ends. 

Returning to Spainin 1967, Salas and Vifiuela 
set up a lab at the CSIC’s Centre for Biologi- 
cal Research in Madrid. They carried out a 
systematic genetic analysis of a bacterial 
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virus, the phage 29. Salas discovered that 
the linear, double-stranded DNA of the virus 
had a specific protein attached at one end of 
each strand. She found that this protein acts 
as a Starting point when the viral genome 
replicates. The mechanism also operates in 
some mammalian viruses, including those 
that infect humans, such as adenovirus and 
hepatitis B virus. 


“For generations of 
Spanish scientists, she 
was a guiding light.” 


In 1977, Salas made her last academic 
move — to the recently founded Severo Ochoa 
Centre for Molecular Biology (CBMSO). The 
centre brought together most of the leading 
research groups in molecular biology in Spain 
in ajoint institution of the CSIC and the Auton- 
omous University of Madrid. Here, with her 
colleague Luis Blanco, Salas isolated the DNA 
polymerase enzyme from 29 that went onto 
have sucha profound impact on DNA analysis. 

Existing methods for DNA amplification, 
using the polymerase chain reaction (PCR), 
were slow and prone toerrors. Using 29 DNA 
polymerase proved to be much more efficient: 
the technique was called multiple displace- 
ment amplification. It was highly accurate, 
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able to generate very large DNA fragments, 
andit worked at aconstant temperatureinthe 
lab, avoiding the need for shifts intemperature 
as in PCR. Requiring only very small samples 
to generate many copies of whole genomes, 
it proved ideal for forensic analysis, for the 
identification of mutations in tumours and 
for genetic analysis of traces of DNA from 
ancient bones. 

Salas and Blanco patented the enzyme, 
and its commercial licensing has provided 
an important source of income for the 
CSIC. Devoted to basic research, Salas was 
also delighted that her work had led to this 
important application. She was director of the 
CBMSO from 1992 to 1994, and continued to 
work enthusiastically at experiments in her lab 
until a few weeks before her death. 

Salas was a courageous role model. She 
was the first woman to chair a big biomedical 
research centre, to be president of the Span- 
ish Biochemistry and Molecular Biology 
Society, and the first to be elected to several 
academies. 

After receiving awards or when celebrating 
ananniversary, Margarita, who was a shy per- 
son with an austere lifestyle, would invite her 
many current and former students to dinner at 
asmart restaurant. Here, she would talk infor- 
mally, about topics unrelated to science, from 
Bach’s music to modern Spanish literature, 
revealing her wide interest in culture. 

Margarita conveyed her passion for 
scientific research and the thrill of discovery to 
her students, fostering their motivation, cre- 
ativity, rigour and perseverance. She created 
a true school, teaching molecular biologists 
from different places and of different origins 
how to conduct — and take joy in — research. 
Many of her trainees are now research leaders. 

For generations of Spanish scientists, she 
was a guiding light. We will miss her, her 
mentorship and her leadership. 


Jesus Avila, Federico Mayor Jr and Lourdes 
Ruiz-Desviat are at the Severo Ochoa Centre 
for Molecular Biology, Madrid, Spain. J.A. 
was Margarita Salas’ first PhD student when 
she returned to Spain, and is currently the 
director of CIBERNED, Spain. F.M.Jr was a 
former director of the Severo Ochoa Centre 
for Molecular Biology and L.R.-D. is the current 
director. 

e-mails: jesus.avila@csic.es; fmayor@cbm. 
csic.es; lruiz@cbm.csic.es 
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Setting the agenda in research 


Comment 


Predatory journals: no 
definition, no defence 


Agnes Grudniewicz, David Moher, Kelly D. Cobey and 32 co-authors 


Leading scholars and 
publishers from ten 
countries have agreed a 
definition of predatory 
publishing that can protect 
scholarship. It took 12 hours 
of discussion, 18 questions 
and 3 rounds to reach. 
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hen ‘Jane’ turned to alternative 
medicine, she had already 
exhausted radiotherapy, chemo- 
therapy and other standard 
treatments for breast cancer. Her 
alternative-medicine practitioner shared an 
article about a therapy involving vitamin infu- 
sions. To her and her practitioner, it seemed 
to be authentic grounds for hope. But when 
Jane showed the article to her son-in-law (one 
of the authors of this Comment), he realized it 
came from a predatory journal — meaning its 
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promise was doubtful and its validity unlikely 
to have been vetted. 

Predatory journals are a global threat. They 
accept articles for publication — along with 
authors’ fees — without performing promised 
quality checks for issues such as plagiarism or 
ethical approval. Naive readers are not the only 
victims. Many researchers have been duped 
into submitting to predatory journals, in which 
their work can be overlooked. One study that 
focused on 46,000 researchers based in Italy 
found that about 5% of them published in such 
outlets’. A separate analysis suggests preda- 
tory publishers collect millions of dollars in 
publication fees that are ultimately paid out 
by funders such as the US National Institutes 
of Health (NIH). 

One barrier to combating predatory pub- 
lishing is, in our view, the lack of an agreed 
definition. By analogy, consider the historical 
criteria for deciding whether an abnormal 
bulge inthe aorta, the largest arteryin the body, 
could be deemed an aneurysm — a dangerous 
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condition. One accepted definition was based 
on population norms, another on the size of 
the bulge relative to the aortaand a third onan 
absolute measure of aorta width. Prevalence 
varied fourfold depending on the definition 
used. This complicated efforts to assess risk and 
interventions, and created uncertainty about 
who should be offered a high-risk operation’. 

Everyone agrees that predatory publishers 
sow confusion, promote shoddy scholarship 
and waste resources. What is needed is consen- 
sus on a definition of predatory journals. This 
would provide a reference point for research 
into their prevalence and influence, and would 
help in crafting coherent interventions. 

To hammer out sucha consensus and to 
map solutions, we and others met in Ottawa, 
Canada, over two days in April this year. The 
43 participants hailed from 10 countries and 
represented publishing societies, research 
funders, researchers, policymakers, academic 
institutions, libraries and patient partners 
(that is, patients and caregivers who proac- 
tively engage in research). Our focus was the 
biomedical sciences, but our recommenda- 
tions should apply broadly. 

Here we put forward our definition. We 
describe what it took to achieve consensus and 
how we'll move forward. 


The definition 


The consensus definition reached was: 
“Predatory journals and publishers are entities 
that prioritize self-interest at the expense of 
scholarship and are characterized by false or 
misleading information, deviation from best 
editorial and publication practices, a lack of 
transparency, and/or the use of aggressive and 
indiscriminate solicitation practices.” 


Road to consensus 


Since the term ‘predatory publishers’ was 
coined in 2010, hundreds of scholarly articles, 
including 38 research papers, have been written 
warning about them. Scientific societies and 
publishers (including Springer Nature) have 
helped to establish the ‘Think. Check. Submit? 
campaign to guide authors. Butit is not enough. 
More than 90 checklists exist to help identify 
predatory journals using characteristics such 
as sloppy presentation or titles that include 
words such as ‘international’. This is an over- 
whelming number for authors. Only three 
of the lists were developed using research 
evidence’. Paywalled lists of quality journals 
and predatory journals show that there is an 
appetite for clear, authoritative guidance. But 
these lists are inconsistent and sometimes 
out of reach*® (see ‘No list to rule them all’). 


NO LIST TO RULE THEM ALL 


Assessments of which journals are likely to be predatory or legitimate do not tally, 
and titles can appear in both categories. There is no way to know which journals 
were considered for a list but left off, or which were not considered. 


Suspected 
predatory journals 


1135 
journals 


Beall's list highlighted the 
issue of predatory journals, 
but faced criticism over 
transparency and legal 
threats from listed titles. It 
ceased operation in 2017. 


Some journals deemed legitimate by 
the DOAJ were deemed predatory 
by Beall’s and/or Cabells lists. 


Legitimate 
—— journals 
Cabells ‘verified’ 


DOAJ® 


11,306 


The DOAS relies mainly 
on information from 
publishers. It regularly 
purges titles that do 
not meet quality 
criteria. 


*Informally assessed by University of Colorado Denver librarian Jeffrey Beall in ~2008-17; 'Pay-to-access lists from Cabells, a scholarly analytics company; ‘The 
Directory of Open Access Journals, a community-curated list requiring journal best practices such as peer review and statements on author fees and licensing. 


A journal’s membership of agencies such as 
COPE (the Committee on Publication Ethics), 
curated indexes such as Web of Science, or 
being listed in the Directory of Open Access 
Journals (DOA)) is insufficient to guarantee 
quality. Predatory journals have found ways 
to penetrate these lists, and newjournals have 
to publish for at least a year before they can 
apply for indexing. 

A scoping review comparing publications 
about predatory journals found that their 
characterizations sometimes overlapped, 
sometimes did not and sometimes directly 
conflicted’. These inconsistencies suggest 
that crafting a practical definition would 
require building consensus across researchers, 
publishers, research institutions and the 
broader public. 

Participants in our summit completed a 
three-round modified Delphi survey (a struc- 
tured technique to elicit input, offer feedback 
and build consensus) that included 18 ques- 
tions and 28 sub-questions. There were also 
12 hours of discussion, followed by 2 further 
rounds of feedback and revision. 

Crafting a consensus definition was hard. 
Even reaching agreement on the use of ‘preda- 
tory’ was a challenge. Part of the group wanted 
aterm that acknowledges that some authors 
turn to these outlets fully aware of their low 
quality; these scholars willingly pay to pub- 
lish in predatory journals to adda line to their 
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CVs. We discussed replacing the term entirely 
with language that recognizes nuances in 
publishers’ quality and motivation. Alterna- 
tives considered included ‘dark’, ‘deceptive’, 
‘illegitimate’ and ‘acting in bad faith’. Ulti- 
mately, we concluded that the term ‘preda- 
tory’ has become recognized in the scholarly 
community. Implementation science suggests 
that introducing new nomenclature would take 
considerable resources, which we felt could be 
better put towards combating predatory pub- 
lishing directly.So we recommend keeping the 
word ‘predatory’ while noting its limitations. 


Details matter 


Predatory journals are driven by self-interest, 
usually financial, at the expense of scholarship. 
They are characterized by the following: 


False or misleading information. This applies 
to how the publisher presents itself. A preda- 
toryjournal’s website or e-mails often present 
contradictory statements, fake impact factors, 
incorrect addresses, misrepresentations of 
the editorial board, false claims of indexing or 
membership of associations and misleading 
claims about the rigour of peer review. 


Deviation from best editorial and publica- 
tion practices. Standards here have been set 
out in the joint statement on Principles of 
Transparency and Best Practice in Scholarly 
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Comment 


Publishing (see go.nature.com/35mq7m)j), 
issued by the DOAJ, the Open Access Scholarly 
Publishers Association, COPE and the World 
Association of Medical Editors. Examples 
of substandard practice include not having 
a retraction policy, requesting a transfer of 
copyright when publishing an open-access 
article and not specifying a Creative Com- 
mons licence in an open-access journal. These 
characteristics can be difficult to know before 
submitting, although such information is 
easily obtained from legitimate journals. 
An unprofessional-looking web page — with 
spelling or grammar mistakes or irrelevant 
text — should also raise red flags. 

Warning signs should be assessed with 
care. For instance, journals are not eligible 
for listing on the DOAJ or joining COPE until 
after one year of operation. A well-meaning 
but poorly resourced journal might not be 
able to maintain a professional website. Also, 
some journals claim to follow best practice but 
do not. Summit participants agreed that the 
burden of proof rests on the journal. 


Lack of transparency. There are two reasons 
welist this separately from deviation from best 
practice. First, transparency in operational 
procedures (such as how editorial decisions 
are made, fees applied and peer review organ- 
ized) is presently somewhat aspirational 
in academic publishing and thus cannot be 
considered a current best practice. Second, 
the absence of transparency in predatory jour- 
nals makes it important enough to highlight 
separately. Predatory publishers often fail to 
provide their contact information or details 
about article processing charges. Editors and 
members of their editorial boards are often 
unverifiable. 


Aggressive, indiscriminate solicitation. 
Although legitimate journals might solicit 
submissions, predatory journals often use 
aggressive solicitation such as repeated 
e-mails. These might be excessively flattering 
in tone, or might mention researchers’ past 
publications while noting that related submis- 
sions are urgently needed for a forthcoming 
issue. A clear warning sign is that the invitee’s 
expertise is outside the journal’s scope. 


Criteria we left out. Some obvious candi- 
dates for this list — journal quality and intent 
to deceive — were deliberately left out. It can 
be tough to distinguish a predatory journal 
from ajournal that is under-resourced. Both 
can below quality, but the latter does not have 
an intention to deceive®” (see also go.nature. 
com/33gmjut and go.nature.com/2afaka7). 
Furthermore, such intent is hard to assess and, 
if many of the characteristics described inthe 
definition are met, identifying intent might 
not be necessary. 

Most controversially, we omitted quality 
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of peer review, even though negligent peer 
review is often a prominent feature of pred- 
atory journals. We are not saying that peer 
review is unimportant, only that it is cur- 
rently impossible to assess. Unfortunately, 
many legitimate journals fail to make their 
peer-review processes sufficiently transpar- 
ent, for instance by sharing peer reviewers’ 
comments and other data. At the moment, 
journal quality, adequacy of peer review and 
deceit are too subjective to include. 


Next steps 


Efforts to fight predatory publishing require 
collaboration and support. Organizations, 
researchers and governments have started to 
respond. To name just a few, in 2017, the NIH 
released a statement encouraging research- 
ers it funds to publish in reputable journals. 
India’s University Grants Commission has cre- 
ated a reference list of respectable journals 
and is currently working to revise academic 
publication incentives and develop atraining 
course to reinforce the message. In November 
2018, COPE held a forum on predatory pub- 
lishing to examine problems and solutions. 
So far, disparate attempts to address pred- 
atory publishing have been unable to control 
this ever-multiplying problem. The need will 
be greater as authors adjust to Plan S and 


“Itcan be toughto 
distinguish a predatory 
journal from ajournal that is 
under-resourced.” 


other similar mandates, which will require 
researchers to publish their workin open-access 
journals or platforms if they are funded by most 
European agencies, the World Health Organiza- 
tion, the Bill & Melinda Gates Foundation and 
others (see www.coalition-s.org). 

Many might argue that, with predatory 
journals adapting so quickly, our group’s 
efforts would have been better spent craft- 
ing interventions or promoting outreach. We 
believe that with this consensus definition, 
we are better prepared to track the problem 
over time, compare the results of studies on 
predatory journals and develop and evaluate 
intervention strategies such as educational 
campaigns and policy mandates. Over the com- 
ing months, we will solicit input and make the 
definition usable so that funders and academic 
institutions can ensure that researchers avoid 
submitting manuscripts to predatory journals 
or listing such publications on their CVs. 

Our first step is to develop a portal that 
presents our definition and other educational 
resources in multiple languages — available 
at https://osf.io/8xvpm — and how to get 
involved. Next, we will establish an interna- 
tional observatory to compile data on the 
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problem, tracking numbers of publications 
in predatory journals by discipline and geog- 
raphy. We will work with funders, institutions, 
patients and other stakeholders to iteratively 
develop resources to assess journal quality. We 
are seeking funding to create and test a digital 
tool to achieve these goals. 

Efforts to counter predatory publishing 
need to be constant and adaptable. The threat 
is unlikely to disappear as long as universities 
use how many publications a scholar has pro- 
duced as a criterion for graduation or career 
advancement. The publish-or-perish culture, a 
lack of awareness of predatory publishing and 
difficulty in discerning legitimate from ille- 
gitimate publications fosters an environment 
for predatory publications to exist. Predatory 
journals are also quick to adapt to policies and 
measures designed to foil them. As scientific 
publishers experiments with new formats and 
business models online, it has become increas- 
ingly easy for fake publishers to masquerade 
as legitimate ones. We invite others to join us 
in our call to action. 
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Readers respond 


Correspondence 


Groundwater: 
acall to action 


As we embark on the United 
Nations ‘decade of action’ (see 
go.nature.com/2opvyi3), and as 
this week’s UN COP25 Climate 
Change Conference concludes 
in Madrid, let’s remember 

the crucial contribution 

of groundwater to climate 
resilience and sustainable 
development. 

Besides sustaining drinking 
water and ecosystems 
worldwide, groundwater acts as 
a subsurface sponge for floods. 
Itis aresource against drought 
and for natural climate solutions 
that sequester soil carbon. 

And it is crucial for sustainable 
development because it enables 
food security and lifts rural 
populations out of poverty. 

However, these essential 
benefits are being undermined 
by the long-term depletion, 
contamination and salinization 
of groundwater (see, for 
example, I. E. M. de Graaf et al. 
Nature 574, 90-94; 2019). 

In our view, groundwater 
needs to be monitored and 
managed with greater rigour 
on regional and global scales 
so that it can be used more 
effectively to boost climate 
adaptation and sustainable 
development. As members of a 
global group of scientists and 
practitioners, we have issued a 
call to action to international 
and national governmental and 
non-governmental agencies, 
development organizations, 
corporations, decision makers 
and scientists, to ensure that 
groundwater benefits society 
now and into the future (see 
go.nature.com/37gnbtb). 


Evaluating Italy’s 
ranking boom 


The president and vice-president 
of the Italian National Agency 
for the Evaluation of Universities 
and Research Institutes 
(ANVUR) claim that Italy’s 

rise in international research- 
impact rankings is a real effect 
(P. Miccoliand R. I. Rumiati 
Nature 574, 486; 2019), and not 
(as we have argued) the result 

of Italian scholars citing one 
another’s articles more heavily 
(see Nature http://doi.org/ 

dcgj; 2019). We question their 
evidence for this claim. 

First, they say that scientific 
productivity in Italy has risen 
inthe past decade, possibly 
stimulated by the introduction of 
performance-related university 
funding. More articles are 
indeed being published, but 
the yearly growth rate of Italy’s 
scientific production has in 
fact slowed down since the 
introduction of performance- 
related targets in 2012, according 
to ANVUR’s own statistics (see 
go.nature.com/34ms9n; in 
Italian). 

Second, they state that 
ANVUR recognizes the 
importance of correcting 
gaming behaviours, including 
self-citation. They point out 
that, in an evaluation of 2011-14 
work, the agency established 
acriterion for ‘downgrading’ 
papers in which self-citation 
exceeded a given threshold. 
ANVUR’s own reports, however, 
show that this downgrading was 
never applied (see go.nature. 
com/2jn2si; in Italian). 

In our view, ANVUR’s claim 
needs to be grounded more in 
fact and less in aspiration. 


Tom Gleeson’ University of 
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We'll take ‘quantum 
advantage’ 


We take issue with the use of 
‘supremacy’ when referring to 
quantum computers that can 
out-calculate even the fastest 
supercomputers (F. Arute et al. 
Nature 574, 505-510; 2019). We 
consider it irresponsible to 
override the historical context 
of this descriptor, which risks 
sustaining divisions in race, 
gender and class. We call for the 
community to use ‘quantum 
advantage’ instead. 

The community claims 
that quantum supremacy isa 
technical term with a specified 
meaning. However, any technical 
justification for this descriptor 
could get swamped as it enters 
the public arena after the intense 
media coverage of the past few 
months. 

In our view, ‘supremacy’ 
has overtones of violence, 
neocolonialism and racism 
through its association with 
‘white supremacy’. Inherently 
violent language has crept into 
other branches of science as 
well — in human and robotic 
spaceflight, for example, terms 
such as ‘conquest’, ‘colonization’ 
and ‘settlement’ evoke the 
terra nullius arguments of 
settler colonialism and must be 
contextualized against ongoing 
issues of neocolonialism. 

Instead, quantum computing 
should bean openarenaand an 
inspiration for a new generation 
of scientists. 


Carmen Palacios-Berraquero Nu 
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Supported by 13 signatories; see 
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Proven protection 
against air pollution 


As researchers for the certifying 
body for personal protective 
equipment in the United 
States, we caution against 
misinterpretation of Wei Huang’s 
and Lidia Morawska’s contention 
that face masks could increase 
health risks from air pollution 
(Nature 574, 29-30; 2019). 
Although the authors attempt 
to distinguish between ‘medical 
masks’ and ‘specialist respirators’, 
aclearer definition of ‘mask’ 
would avoid confusion over 
the capabilities of different 
protective devices. As they point 
out, surgical masks are loose- 
fitting and ineffective against air 
pollution. However, respirators 
approved by the National 
Institute for Occupational Safety 
and Health (NIOSH) fit tightly 
tothe face and filter at least 95% 
of airborne particles, including 
aerosolized nanoparticulates 
(E. Vo etal. Ann. Occup. Hyg. 59, 
1012-1021; 2015). 
Even NIOSH-approved 
respirators that have not 
been personally fitted 
provide some protectionin 
non-occupational settings 
(see go.nature.com/35ztfy). 
Outdoor workers in California, 
for example, wore such 
devices as safeguards against 
non-oily particulate hazards 
produced by this year’s wildfires 
(go.nature.com/35jwdw). 
Anabsence of evidence from 
clinical trials is no reason not to 
take precautionary measures. 


Tyler D. Quinn, Lew Radonovich, 
Maryann D‘Alessandro Centers 
for Disease Control and 
Prevention, National Institute for 
Occupational Safety and Health, 
Pittsburgh, Pennsylvania, USA. 
yhh7@cdc.gov 

Disclaimer: These conclusions 
do not necessarily represent the 
official position of NIOSH. 
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Medical research 


Ready-made cellular 
plugs heal skin wounds 


Mark C. Coles & Christopher D. Buckley 


The finding that a thin sheet of fibrous tissue under the skin 
contains a prefabricated, movable cellular sealant that can 
heal deep wounds might have implications for the treatment 


of scars and ulcers. See p.287 


Skin consists of an outer epidermal layer 
(the epidermis) and an inner dermal 
layer (the dermis). If you pinch your skin, you 
can lift it because these two cellular layers 
move freely above a membranous sheet 
called the fascia, which contains cells and 
extracellular-matrix material. This gelati- 
nous tissue creates a frictionless interface 
between the skin and the more rigid struc- 
tures beneath it, such as muscle and bone. 
However, it now seems that the fascia has 
roles beyond providing a non-stick surface. 
On page 287, Correa-Gallegos et al.' report 
that the fascia contains a movable sealant 
that patches up deep injuries to enable rapid 
wound repair. 

The scar tissue of a healing skin wound 
contains fibroblast cells, which make and 
modify extracellular-matrix proteins. These 
fibroblasts can be identified by their expres- 
sion of a protein called Engrailed-1, and are 
termed Engrailed-positive fibroblasts (EPFs). 
The idea that the fascia might be a repository 
of cellular components involved in wound 
healing and scar formation came froma pre- 
vious study’, which reported that EPFs reside 
not only in the skin, as expected, but also in 
the fascia. 

To investigate wound healing in mice, 
Correa-Gallegos and colleagues grafted 
fascia that contained cells engineered to 
express green fluorescent protein onto skin 
cells expressing red fluorescent protein. The 
authors then wounded this dual-coloured 
‘fluorescent sandwich’ and transplanted it 
into a healthy mouse. Comparison of the 
percentages of green and red cells revealed 
that 80% of cells in the healing wound came 
from the fascia. Furthermore, the vast major- 
ity of many cell types found in the healing 
injury originated from the fascia, including 


contractile fibroblasts (or myofibroblasts), 
blood-vessel cells, macrophages of the 
immune system and nerve cells. 

To confirm that their observations were 
not due to any peculiarities of this artificial 
grafted structure, the authors injected adye 
into the fascia of mice, and then gave the mice 
a deep wound that penetrated the animals’ 
skin and fascia. The authors mapped the 
dye-labelled cells that populated the healing 
wound and the surrounding scar tissue. More 
than half of the cells in the healed wound 
were labelled with the dye, confirming that 
the fascia is a major source of scar-forming 
tissue after deep injury. 


Deep wound 
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Deep wounds lead to scars that are larger 
and harder to heal than those arising from 
superficial wounds that do not penetrate 
the fascia’. The authors used two-photon 
microscopy to analyse deep skin wounds 
in mice engineered‘ to express fluores- 
cent proteins, which can be used to trace 
scar-forming EPFs. They found that a cellular 
plug in the fascia, consisting of extracellu- 
lar matrix, macrophages, blood vessels and 
nerves, moved upwards into the damaged 
skin to form ascar. This healing process did 
not require cell division, indicating that the 
plug was prefabricated. Importantly, the 
authors found that key proteins that have 
been reported to define the types of fibro- 
blast found in scars° are expressed at higher 
levels on fascial than on dermal fibroblasts, 
consistent with a model in which fascial EPFs 
are a major source of fibroblasts in healing 
deep wounds (Fig. 1). 

Given that fibroblasts regulate the extra- 
cellular matrix, the authors used microscopy 
to visualize physical features of fibres of 
the protein collagen, which is acomponent 
of the extracellular matrix. Collagen in the 
fascia was more coiled and immature than 
were the stretched and interwoven collagen 
fibres in the dermis. Furthermore, when a 
fluorescent dye was used to tag collagen in 
an injured animal, this revealed that the extra- 
cellular matrix of the fascia moved upwards 
like a pliable gel into the damaged tissue, to 


Wound healing 


Macrophage 


Figure 1| The healing of deep skin wounds. The skin consists of an outer layer called the epidermis and 

an inner layer, the dermis. Superficial wounds no deeper than skin level can be repaired by cells called 
Engrailed-positive fibroblasts (EPFs) in the dermis, which make extracellular-matrix material. Working 

with mice, Correa-Gallegos et al.\investigated the healing of deep wounds that penetrated below the 

skin into a layer known as the fascia. The fascia contains EPFs, extracellular matrix, blood vessels, nerves and 
immune cells called macrophages. The authors report that a prefabricated plug of material from the fascia 
moves upwards, steered by fascial EPFs, to seal the wound. (Image based on Fig. 6 of ref. 1.) 
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plug and then repair the wound. By contrast, 
dermal collagen remained immobile. 

The authors then tested whether EPFs from 
the fascia drive the movement of the prefabri- 
cated plug. They inserted non-adhesive mem- 
branes in mice to separate the fascia from the 
dermis, which resulted in delayed repair and 
non-healing wounds that remained open. 
Animals in which these membranes were 
not inserted did not show these effects. The 
removal of fascial EPFs by a genetic approach 
also resulted in the plug not entering wounds 
and in poor healing. These findings indicate 
that fascial EPFs do indeed steer the plug that 
seals deep wounds. 

Although this study has potential relevance 
for human disease, most of the work was 
carried out in an artificial mouse model. 
Moreover, mice have a type of muscle called 
the panniculus carnosus, which lies between 
the fasciaand the skin and is used to twitch the 
skin®. However, humans lack this twitching 
ability and have only a small remnant of this 
muscle. Therefore, the authors needed to 
determine whether scar formation occurs in 
a similar manner in humans and mice despite 
such differences. 

The team analysed fascial fibroblasts in 
human skin and investigated a type of human 
raised scar called a keloid, which grows big- 
ger than the original injury and can be pro- 
foundly itchy, inflamed and painful’. Many 
of the proteins that characterize the mouse 
fascia were also highly expressed in human 
fascia and keloid scars. This similarity sug- 
gests that the same processes are involved 
in wound healing and scar formation in both 
species. However, it is not yet clear whether 
these findings in mice reveal general princi- 
ples that are relevant to human skin disease. 

The authors’ findings provide satisfying 
potential explanations for some unsolved 
clinical conundrums. Nerves, blood vessels 
and macrophages in the prefabricated plug 
are dragged into the mouse wound; if the 
same phenomenon occurs in humans, this 
could explain why keloids itch and are painful. 
Keloid formation is more common at sites of 
thicker fasciae (such as the chest, back and 
thighs) than at sites where the fascia is thinner 
(for example, the feet), which is consistent 
with a model in which the fascia drives keloid 
formation. 

Could these discoveries about the skin 
shed light on other clinically relevant 
fibrotic diseases (conditions associated with 
the accumulation of extracellular matrix) 
that affect organs in which the fascia is not 
present, suchas the lungs and liver? Perhaps 
the mechanisms uncovered in mice might 
have relevance for the processes underly- 
ing skin damage in the leg ulcers that can 
develop in people who have diabetes. In 
any case, it is clear that advances made in 
understanding the biology of the fascia 
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might reveal new targets for treating scar- 
ring diseases of the skin. 
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Heat transferredina 
previously unknown way 


Karthik Sasihithlu 


Experiments show that quantum fluctuations can allow heat 
to be transported between two objects separated by avacuum 
gap. This effect could be harnessed to exploit and control heat 
transfer in nanoscale devices. See p.243 


Acoustic waves and electromagnetic waves 
can transport heat between objects through 
their respective energy carriers: phonons and 
photons. At or near room temperature, the 
heat transfer between objects separated by 
a material medium occurs at a much higher 


rate when facilitated by phonons than by 
photons. However, phonons are generally 
thought to be ineffective at transporting heat 
between objects separated by a vacuum gap, 
because these energy carriers are vibrations 
in an atomic lattice and thus would require a 
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Figure 1| Phonon transmission across a vacuum. Fong et al.‘ show that phonons — vibrations in an atomic 
lattice — can be transported between objects that are separated by a vacuum gap. To understand how 

this process occurs, consider an object at a fixed temperature T,. Thermal agitation of the object’s atoms 
produces phonons that propagate as acoustic waves and cause the object’s surface to exhibit time-varying 
undulations (the amplitudes of the undulations shown are exaggerated for clarity). Asecond object, at 

a fixed temperature T, < T,, is brought close to the first object, with a vacuum gap between the objects. 

The undulations of the first object’s surface exert a time-varying ‘Casimir’ force (caused by quantum 
fluctuations) on the second object’s surface, which gives rise to phonons in the second object. Because 
phononsare heat carriers, heat is transferred from the first object to the second one. 
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material medium to propagate. On page 243, 
Fong et al.' report experimental evidence that 
phonons can travel across a vacuum gap and 
therefore induce heat transfer between vacu- 
um-separated objects because of the effect of 
quantum fluctuations. 

In simple terms, quantum fluctuations 
can be understood as being the source of an 
electromagnetic signal that a perfectly sen- 
sitive detector would detect in a vacuum, 
even when this vacuum is shielded from all 
possible internal and external sources of 
electromagnetic waves, such as charges and 
currents”. The fluctuations are a consequence 
of a law in quantum mechanics known as 
Heisenberg’s uncertainty principle*, which 
states that certain pairs of physical quantities 
cannot be determined at the same time with 
absolute precision. The presence of quantum 
fluctuations subtly influences surrounding 
matter, leading to several observable effects. 

One of these effects, relevant to Fong and 
colleagues’ work, is the Casimir force* — the 
force that two neutral atoms separated bya 
vacuum gap exert on each other. The Casimir 
force results when quantum fluctuations 
induce fluctuating charge densities in these 
atoms; the charge densities then interact 
through their electric fields. The force that 
sticks a gecko’s foot to a wall is an example of 
a macroscopic manifestation of the Casimir 
force. It arises from the combined interactions 
between fluctuating charge densities in all the 
atomic constituents of the two objects. 

To understand how the Casimir force 
can induce phonon transfer between vacu- 
um-separated objects, consider an object that 
is maintained at a particular temperature by 
being kept in contact witha heat source (Fig. 1). 
Thermal agitation of the object’s atoms, which 
can be thought ofas being interconnected by 
elastic springs, gives rise to phonons. In the 
presence of these phonons, the surface of the 
object undulates over time. When a second 
object is brought close to the first one, it is sub- 
jected to atime-varying Casimir force owing to 
its interaction with the undulations of the first 
object’s surface. The second object’s surface is 
thus subjected to tugging that then gives rise 
to phonons in the object’s interior. Phonons 
are therefore transmitted from the first object 
to the second one. 

Because phonons are heat carriers, when 
they aretransported from one object to another 
across a vacuum gap, aS a result of the Casimir 
force, they induce heat transfer if the second 
object is maintained at a lower temperature 
than that of the first one. This phenomenon of 
heat transport facilitated by the Casimir force 
has been predicted previously using theoretical 
models>’. Fong etal. have now measured such 
a heat-transfer mode experimentally. 

The authors used a technique called optical 
interferometry to observe the thermal agita- 
tion of atoms (Brownian motion) at the surface 


of amembrane. This membrane was kept in 
contact with a heat source held at a constant 
temperature. Measurements of thermal agi- 
tation can be related to, and therefore used 
as a gauge for, the temperature of the atoms 
at the membrane’s surface. Moreover, the dif- 
ference in this temperature with and without 
Casimir interaction with another, closely jux- 
taposed membrane is directly proportional 


“Fong and colleagues’ work 
provides conclusive evidence 
that the Casimir force can 
induce heat transfer.” 


tothe resulting heat transfer between the two 
interacting membranes. The authors used 
these features to estimate the amount of heat 
transmitted between the membranes for vac- 
uum gaps of different sizes. They found that 
their measurements accurately conform to 
theoretical estimates of such heat transport. 

Fong and colleagues’ work provides conclu- 
sive evidence that the Casimir force caninduce 
heat transfer. However, the use of this method 
to transport heat between two objects is lim- 
ited, because the Casimir force decreases rap- 
idlyin strength as the space between the objects 
is increased. It is only when the gap between 
two objects is of the order of afew nanometres 
that the Casimir force is strong enough for 
this heat-transfer mode to dominate over 


Structural biology 


competing modes, suchas photon tunnelling’. 

The authors discovered a way to amplify 
the Casimir mode of heat transfer so that it 
remains dominant even when the gap between 
the membranes is in the range of hundreds of 
nanometres. The membranes were carefully 
designed in sucha way that their dimensions 
and the temperatures at which they were main- 
tained allowed them to vibrate with their maxi- 
mum possible displacements — in other words, 
at their natural frequencies. Thus, applications 
that are devised to exploit this heat-transfer 
mode to dissipate heat (suchas ina hard-disk 
drive, where the distance between the writing 
head and the storage disk is a few nanometres) 
would require such careful design to ensure 
that the mode is amplified. Achieving this 
would bea challenge for the future. 
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Malaria parasites fine-tune 
mutations to resist drugs 


Leann Tilley & Philip J. Rosenthal 


Drug resistance in malaria parasites is mediated by mutations 
ina transporter protein. The transporter’s structure reveals 
the molecular basis of how key mutations bring about 
resistance to different drugs. See p.315 


About half a million people, most of them 
children living in Africa, are killed each year 
by malaria’. Management of malaria, par- 
ticularly that caused by the highly virulent 
protozoan parasite Plasmodium falciparum, 
is challenged by the emergence of resistance 
to antimalarial drugs’. On page 315, Kimetal.? 
report the structure and molecular properties 
of akey protein that facilitates resistance, the 
P. falciparum chloroquine-resistance trans- 
porter (PfCRT). The structure reveals the 
consequences of finely tuned mutations of the 
amino-acid residues that line a crucial central 
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cavity in PfCRT. These mutated residues allow 
resistant parasites to transport certain anti- 
malarial drugs away from their site of action — 
and the effect of the mutations is different for 
closely related drugs. 

Malaria parasites spend part of their life 
cycle inside human red blood cells. There, 
they use a specialized membrane-bound com- 
partment known as the digestive vacuole to 
degrade the protein haemoglobin, thereby 
generating amino-acid building blocks for 
growth’. Haemoglobin digestion also pro- 
duces a toxic side product called haem, 
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which is exploited by antimalarials from the 
quinoline family of drugs (which includes 
chloroquine and piperaquine). These drugs 
bind to the released haem in the digestive 
vacuole and prevent the compound's detoxi- 
fication by the parasite — in effect, poisoning 
the parasite with its own metabolic debris‘. 

Chloroquine’s affordability, safety and 
efficacy made it the drug of choice for com- 
bating malaria, until widespread resistance 
developed in the 1980s. Piperaquine is a 
structurally related compound that retains 
activity against chloroquine-resistant para- 
sites and is currently used in combination with 
another antimalarial drug, dihydroartemis- 
inin. Unfortunately, resistance to piperaquine 
is now widespread in parts of southeast Asia®. 
Paradoxically, piperaquine-resistant parasites 
are often more sensitive to chloroquine than 
are piperaquine-sensitive parasites’. 

Drug resistance can arise because of 
mutations either in the drug’s target or in bio- 
logical machinery that transports the drug 
to or from the target. A landmark report’ in 
2000 identified PfCRT as the main mediator of 
chloroquine resistance in P. falciparum. Resist- 
ance emerged independently at different loca- 
tions around the world, butis always associated 
witha particular mutation inthe transporter — 
the substitution ofa lysine amino-acid residue 
by athreonine residue (a K76T mutation). K76T 
combines with other geographically specific 
PfCRT mutations to mediate resistance and 
improve the fitness of the mutant parasites® ®. 

PfCRT is a member of the superfamily of 
proteins known as drug/metabolite trans- 
porters? and is located in the membrane of 
the parasite’s digestive vacuole. Structural 
and biochemical studies of related pro- 
teins suggest that resistance to chloroquine 
arises as a result of the transporter passing 
chloroquine out of the digestive vacuole, 
thus removing it from its site of action®. Inter- 
estingly, piperaquine resistance emerges 
when parasites harbouring the K76T mutation 
acquire further mutations°. 

Efforts to design new resistance-busting 
quinoline antimalarials have been stymied 
by alack of information about the structures 
of mutated PfCRT. But in the past few years, 
advances ina technique called cryo-electron 
microscopy” (cryo-EM) have revolutionized 
structural biology by enabling the direct 
imaging of membrane-embedded proteins, 
such as PfCRT, that are not amenable to study 
using X-ray crystallography. Kim et al. used 
single-particle cryo-EM to determine the 
structure of PfCRT in South American 7G8 
parasites, which harbour mutations that 
confer high-level chloroquine resistance. 

The authors first had to work outa protocol 
for efficiently expressing, purifying and 
reconstituting PfCRT into a membrane-like 
environment, to maintain a native conforma- 
tion of the protein. PfCRT is relatively small 
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Figure 1| Mutations affect the transport of antimalarial drugs through the PfCRT protein. a, The 
drugs chloroquine and piperaquine target a membrane-bound organelle in the malaria parasite, called 

the digestive vacuole. Plasmodium falciparum chloroquine-resistance transporter (PfCRT) resides in the 
membrane of the digestive vacuole of the most virulent species of malaria parasite. A positively charged 
lysine amino-acid residue (K76) in wild-type PfCRT is thought to repel the positively charged drugs, 
preventing transport out of the vacuole (arrow) — which would lead to drug resistance. b, Kim et al.’ report 
the structure of PfCRT that contains a mutation known as K76T. The authors find that K76T alters the charge 
distribution of the lining of the PfCRT cavity. Both chloroquine and piperaquine bind to the mutated cavity, 
but only chloroquine passes through it (possibly because it has a lower positive charge than piperaquine), 
enabling resistance to chloroquine, but not to piperaquine. c, Further mutations in PfCRT (such as the 
C350R mutation) cause piperaquine to bind more weakly to the cavity than in b. This potentially underpins 
the drug’s ability to pass through PfCRT and leads to piperaquine resistance. In this scenario, the transport 
of chloroquine out of the digestive vacuole is less efficient than the transport of piperaquine. 


(49 kilodaltons) compared with proteins that 
are typically studied using cryo-EM, and so 
Kim et al. prepared an antibody fragment 
(known as an antigen-binding fragment, or 
Fab) that binds to PfCRT, thereby forming a 
complex that has sufficient mass and stability 
to allow cryo-EM-based structural elucida- 
tion. This approach yielded a structure with 
3.2-angstr6m resolution. 

PfCRT was revealed to have ten transmem- 
brane domains and a negatively charged 
central cavity. The cavity opens on to the 
digestive vacuole, but closes about halfway 
throughthe membrane. The Fab-bound cavity 
has an opening 25 Ain diameter, whichis large 
enough to contain chloroquine (the maxi- 
mum dimension of which is about 14 A) and 
piperaquine (with a maximum dimension of 
about 21 A). The structure shows that wild-type 
PfCRT has a positively charged lysine residue 
(K76) positioned in the cavity. This residue is 
thought to repel both chloroquine (which has 
two positive charges) and piperaquine (which 
has four positive charges), thereby trapping 
them in the digestive vacuole (Fig. 1a). 

Interestingly, Kim and colleagues’ biochem- 
ical experiments show that PfCRT from 7G8 
parasites can bind both chloroquine and pipera- 
quine, but transport only chloroquine; thus, 
parasites with the K76T mutation are resistant 
tochloroquine, but not piperaquine (Fig. 1b). By 
contrast, when the authors introduced further 
mutations to 7G8 that have been observed in 
South American” and southeast Asian* para- 
sites in the past five years, piperaquine efflux 
increased — an effect that was associated with 
decreased sensitivity to piperaquine, but 
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increased sensitivity to chloroquine (Fig. Ic). 

Kim et al. used their cryo-EM structure to 
carry out molecular modelling and electro- 
static analysis of PfCRT, to help explain why 
mutations can have opposite effects on the 
sensitivity of malaria parasites to chloroquine 
and piperaquine. The modelling suggests that 
mutations associated with piperaquine resist- 
ance can reduce the negative charge or alter 
the conformation of the PfCRT central cavity. 
This might prevent piperaquine from binding 
too tightly to the cavity and thus increase its 
transport out of the digestive vacuole. The 
authors propose that the distribution of sur- 
face charges in the cavity can be fine-tuned 
so that the initial binding of a drug to PfCRT, 
and its subsequent release for transport, is 
different for different drugs, thereby produc- 
ing distinct effects on drug sensitivity. Taken 
together, Kim and colleagues’ findings show 
that P. falciparum is engaged in an ongoing 
balancing act, generating mutations that block 
the action of different drugs while maintaining 
optimal fitness of the parasite. 

A limitation of the new structure is that 
PfCRT is locked ina conformation in which itis 
open tothe digestive vacuole, as a result of Fab 
binding in its central cavity. Further studies 
will be required to work out how drug binding 
couples to the conformational rearrange- 
ments that permit drug transport. 

On a practical level, it is particularly 
important to understand the mechanisms of 
resistance that are likely to arise in Africa, where 
more than 90% of cases of malaria caused by 
P.falciparum occur’. Compared with other parts 
of the world, there are fewer PfCRT mutations 


in Africa, where resistance to chloroquine is 
decreasing in many countries and where the 
combination of piperaquine and dihydroar- 
temisinin remains highly effective”. We can 
look forward to further studies of PfCRT, 
including visualization of the drug-bound 
and open-to-cytoplasm structural conforma- 
tions, which will further explain the effects of 
resistance mutations and might help to identify 
drugs that circumvent resistance. For now, we 
can appreciate the insights gained from Kim 
and colleagues’ beautiful marriage of structure, 
biochemistry, genetics and parasitology, and 
particularly from the first atomic-resolution 
structure of PfCRT — the fine-tuned, resist- 
ance-mediating machine of malaria parasites. 
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Astep closer to 


the Sun’s secrets 


Daniel Verscharen 


NASA’s Parker Solar Probe is currently making a series of 
close encounters with the Sun. Initial observations from the 
spacecraft have improved our understanding of both the Sun 
and its environment. See p.223, p.228, p.232 & p.237 


Although the Sunis quite near tous compared 
with other stars, it has always kept intrigu- 
ing and fundamental scientific secrets from 
us. For instance, we still don’t know how the 
solar corona — the Sun’s outermost atmos- 
phere — maintains temperatures in excess of 
one million kelvin, whereas the visible sur- 
face has temperatures of just below 6,000 K 
(ref. 1). The corona produces the solar wind, 
an outflow of plasma particles (free ions 
and electrons) that expands into the space 
between the planets. In 2018, NASA launched 
the Parker Solar Probe? (PSP) with the aim of 
identifying the mechanisms behind the heat- 
ing of the coronaand the acceleration of the 
solar wind. Four papers in Nature* ° report 
the first results from the PSP. 

The measurements from the PSP were 
taken when the spacecraft was as close as 
24 million kilometres to the Sun (for compar- 
ison, the average distance between Mercury 
and the Sun is about 58 million kilometres). 
They show that the solar wind near the Sun 
is much more structured and dynamic than 
it is at Earth (Fig. 1). On page 237, Bale et al? 
present measurements of the direction and 


strength of the Sun’s magnetic field, which 
is dragged out into space by the solar wind. 
The authors find rapid reversals in the direc- 
tion of the field that last for only minutes. 


Plasma blob 


Although some similar magnetic structures 
have been seen before’, the large amplitude 
and the high occurrence rate of these rever- 
sals are surprising. In fact, the nature of these 
structures remains unknown. 

Bale and colleagues also report that the 
PSP’s sensors detected fluctuations in the 
local electric and magnetic fields in the solar 
wind that are larger than those detected near 
Earth. These fluctuations can be generated 
by turbulence in the solar wind or by plasma 
instabilities that are driven by ions or elec- 
trons. The presence of such fluctuations sug- 
gests that plasma instabilities have a much 
larger effect onthe dynamics and energetics 
of the solar wind than previously expected. 

On page 228, Kasper et al.* present observa- 
tions of the Sun’s plasma ions and electrons. 
They find that the reversals in the Sun’s mag- 
netic field are often associated with localized 
enhancements in the radial component of the 
plasma velocity (the velocity in the direction 
away from the Sun’s centre). The authors 
use the extremely clear signal of the solar 
wind’s strahl — a collimated and fast beam 
of electrons that stream along the magnetic 
field — to study the field’s geometry and con- 
figuration. This method leads Kasper and 
colleagues to interpret the magnetic-field 
reversals as travelling S-shaped bends in the 
field lines coming from the Sun. 

These authors also report a surprisingly 
large azimuthal component of the plasma 
velocity (the velocity perpendicular to the 
radial direction). This component results 
from the force with which the Sun’s rotation 
slingshots plasma out of the corona when the 
plasmais released from the coronal magnetic 
field— much like aspinning hommer-thrower 
slingshots the hammer when releasing it from 
their hands. However, the reason for the large 
observed value of the azimuthal velocity is 
currently unclear. 


S-shaped bend 


Magnetic-field line 


Strahl electron or 
energetic particle 


Figure 1| The near-Sun environment. The Sun’s outermost atmosphere generates an outflow of plasma 
particles (ions and electrons) called the solar wind. ‘Strahl’ electrons and energetic particles in the wind 
stream along the Sun’s magnetic-field lines. Four papers® ° report observations from the Parker Solar Probe 
(PSP), which is currently in orbit around the Sun. The PSP data suggest that the field lines contain S-shaped 
bends and that the Sun releases blobs of plasma that form part of the young solar wind. The ultraviolet-light 
image of the Sun was taken by NASA’s Solar Dynamics Observatory on the day that the PSP made its first 


close encounter with the Sun. 
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On page 223, McComas et al.’ study detec- 
tions of energetic ions and electrons, some of 
which are observed more often inthe region 
just outside the corona than they are near 
Earth. These particles are accelerated by 
flares (eruptions of radiation) inthe corona 
or by shock waves associated with coronal 
mass ejections (eruptions of plasma), which 
travel through interplanetary space. The 
authors identify particles corresponding to 
both types of source region. 

Because energetic particles travel along 
the Sun’s magnetic field, the difference inthe 
time at which fast and slow particles arrive 
at the PSP can be used to estimate the path 
length of their trajectory along the field. 
McComas and colleagues find that this path 
length is longer than expected, which sug- 
gests that the magnetic field has a more com- 
plicated geometry than assumed. This finding 
could be accounted for by the S-shaped mag- 
netic-field reversals. 

The imaging instrument on board the 
PSP makes remote observations of light 
scattered by electrons and dust near the Sun. 
On page 232, Howard et al.* report that the 
intensity of the dust-scattered light decreases 
with distance from the Sun in almost the same 
way as it does when observed from Earth. 
However, the authors find some preliminary 
evidence for the existence of a hypothesized 
dust-free zone’ near the Sun that has not been 
detected before. The detailed images from the 
PSP also show spatial variations in the solar 
wind that are consistent with variations inthe 
Sun’s magnetic field onits surface, and reveal 
small blobs of plasma that are ejected from 
the Sunand form part of the young solar wind. 

These four papers show that, by going into 
anunexplored region of the Solar System, the 
PSP has already made great discoveries. In 
the near future, it will be important to com- 
bine all the available sources of information 
to develop a deeper understanding of the 
physics of the Sun and the solar wind. For 
instance, researchers should combine the 
measurements of the electric and magnetic 
fields with detailed observations of the 
plasma particles to determine how fields 
and plasma interact and drive instabilities’. 
They must also study the large azimuthal flow 
velocity further to confirm whether it is a per- 
sistent feature or just a one-time exception 
during these initial PSP measurements. 

The use of magnetic-field models will 
enable scientists to learn more about the 
path of energetic particles between the 
Sun and the PSP, and, in turn, about space 
weather — the effects of the Sun and the solar 
wind on Earth and human technology. These 
energetic-particle studies must also be linked 
with remote observations of the Sun’s surface 
and the corona. Examining the potential pres- 
ence of the dust-free zone near the Sun must 
be another short-term goal, but might have to 
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wait for closer approaches of the PSP to the 
Sun in the future. 

It is expected that PSP data will guide our 
understanding of the Sun and the solar wind 
for many years. New models and theories will 
be motivated by the spacecraft’s discover- 
ies, and this knowledge will be transferable 
to other stars and astrophysical plasmas 
throughout the Universe. After all, the Sun 
is the only star that we can study up close 
using spacecraft. The orbit of the PSP will 
bring the spacecraft even closer to the Sun 
in the coming years, to just over 6 million 
kilometres from the surface’. During this 
time, the Sun will transition into a more active 
phase of its 11-year cycle, so we can expect 
even more-exciting results soon. 

In 2020, the European Space Agency will 
launch the Solar Orbiter mission’®. Although 
this spacecraft will not go quite as close to 
the Sun as will the PSP, its more extensive 
suite of scientific instruments will be used 
in combination with the PSP to reveal key 
information about the Sun. For example, 
Solar Orbiter will measure the elemental 


Electronics 


composition and charge states of ions and will 
take photographs of the Sun in different wave- 
lengths of light. These joint measurements 
will certainly close some of the remaining 
gapsin our knowledge of the Sun and the solar 
wind. For now, however, the Sun has proved 
again that it still holds more secrets for us to 
discover. 
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Graphene sees the light in 
wearable sensors 


Deji Akinwande & Dmitry Kireev 


Graphene coated with nanoparticles has been used to make 
wearable light sensors that measure the human pulse and 
blood oxygen levels from ambient light passing through tissue, 
offering a potential platform for health-care monitoring. 


The popularity of wearable technology 
has risen enormously, with the US market 
projected to be in the tens of billions of 
dollars by 2022 (see go.nature.com/33tcein). 
However, the effectiveness of the most com- 
mon wearable devices is hindered by the 
physical specifications of their components: 
although the device is often embedded ina 
flexible soft shell, the main parts, such as 
the sensors and electronics, are still rigid’. 
Now, writing in Science Advances, Polat et al.* 
report a class of truly flexible, transparent 
wearable device that is based on graphene 
covered with a layer of semiconducting nan- 
oparticles known as quantum dots. Impres- 
sively, the devices measure various vital signs 
using only ambient light as a signal. 
Materials that are just one or a few atoms 
thick are said to be two-dimensional. The 
best-known example is graphene, which 
consists of single sheets of carbon atoms 
arranged ina hexagonal lattice. 2D materials 
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in general, and graphene in particular, have 
tremendous potential for the development 
of next-generation wearable, soft biosensors 
because they combine electrical conductiv- 
ity, optical transparency and mechanical 
flexibility with outstanding biocompatibil- 
ity* and stability to biological electrolytes. 
Graphene-based tattoo-like devices® have 
previously been used to record human health 
signals such as heart rhythm, skin hydration 
and body temperature. Their outstanding 
performance is associated with the subna- 
nometre thickness of graphene, which allows 
it to bend and stretch with the skin, without 
affecting the sensor performance. 

Polat et al. have now expanded the func- 
tionality of graphene in wearable devices by 
depositing light-sensitive quantum dots made 
of the semiconductor lead(11) sulfide (PbS) 
onto the graphene layer. When illuminated, 
the quantum dots generate pairs of charged 
particles: negatively charged electrons and 
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Figure 1| A sensor that detects vital signs using light. a, Polat et al.> have used graphene coated with 
semiconducting nanoparticles (not shown) to make flexible, transparent devices that can detect the light 
transmitted through tissue. Ambient light passes easily through human skin and tissue, but is absorbed 

by the haemoglobin in red blood cells. By monitoring the modulation of ambient light as it passes through 
tissue, the sensors can thus monitor blood flow and heart rate. The rate of respiration can also be estimated 
from mathematical analysis of the blood-flow data. b, Moreover, absorption of red and near-infrared light 
by oxygen-rich red blood cells is significantly different from that by oxygen-poor cells. By monitoring the 


absorption of red and near-infrared light, the sensors can monitor the oxygen content of the blood. 


positively charged holes (quasiparticles asso- 
ciated with the absence of an electron in an 
atomic lattice). The electrons stay trapped 
in the quantum dots, but the holes are trans- 
ferred into the graphene layer and increase its 
electrical conductivity, producing a measur- 
able electrical signal. The authors used this 
behaviour to construct light sensors from the 
quantum-dot-coated graphene. 

The researchers observed that the respon- 
sivity (the electrical output per optical input) 
of their devices was remarkably large. The 
high responsivity is attributable to the fact 
that the holes in the graphene layer are recy- 
cled by the quantum dots, effectively increas- 
ing the number of charge carriers generated 
per absorbed photon in the devices — the 
devices are said to exhibit a photoconduc- 
tive gain. 

Previously reported light sensors typically 
do not have photoconductive gain, and there- 
fore require an amplifier device to boost the 
electrical signal; this increases both power 
consumption and the size of the overall 
device®. Moreover, the amplifier must be 
in close proximity to the sensor, which can 
limit the ability of wearable devices to take on 
the contour of the skin. The intrinsic photo- 
conductive gain of Polat and colleagues’ 
devices eliminates the need for an amplifier, 
solving the above problems and making the 
sensors particularly suitable for real-life 
applications. 

So how were the sensors used to measure 
vital signs? Light at certain wavelengths 
passes easily through human skin and adja- 
cent tissue, but is absorbed strongly by 
blood’ — more specifically, it is absorbed by 
haemoglobin, the molecule that transports 
oxygen in red blood cells. By continuously 
monitoring the intensity of light passing 


through tissue, sensors can produce read- 
outs called photoplethysmograms (PPGs) 
that contain information about volumetric 
changes to blood vessels, which can be corre- 
lated to heart rate’. Polat and colleagues show 
that their wearable devices can, remarkably, 
use the ambient light that passes through 
tissue to measure human heart rates accu- 
rately. Moreover, the sensitivity of the devices 
allowed the researchers to estimate the rate 
of breathing by mathematically analysing 
the PPG data. Physical movements associ- 
ated with breathing usually produce arte- 
facts and noise in the PPG signals detected 
by rigid wearable devices’, but the physical 
unobtrusiveness and flexibility of the new 
devices overcome this problem. 


“Graphene has paved the way 
for other 2D materials to be 
used in sensors and mobile 
health-monitoring devices.” 


Polat et al. report that their wearable 
devices can also monitor another vital 
health signal that is often checked by doc- 
tors: arterial oxygen saturation (SpO,), which 
is the percentage of haemoglobin in blood 
that is loaded with oxygen (Fig. 1). Low SpO, 
levels can result in loss of consciousness, 
impaired mental functions, and respiratory 
and cardiac arrest. The absorption of red light 
and near-infrared light by oxygen-rich red 
blood cellsis significantly different fromthe 
absorption by oxygen-free cells. The authors 
therefore estimated SpO, levels by using their 
devices to measure light absorption at these 
two wavelengths. 

Finally, Polat and colleagues’ reported a 
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further application of their technology: the 
monitoring of ultraviolet light. Certain UV 
wavelengths can be harmful to the skin, and 
can potentially even cause cancer”, making it 
desirable to measure UV levels inthe environ- 
ment. The authors show that their devices can 
be integrated with previously fabricated chips 
that enable the sensors to wirelessly transfer 
UV measurements to a mobile phone, thus 
enabling continuous and convenient moni- 
toring of the environmental UV index. 

The reported sensors are all designed to 
communicate wirelessly to any other elec- 
tronics needed for a wearable device, clearly 
separating the soft sensor from any rigid com- 
ponents. But the wireless design requires a 
read-out device (such as a mobile phone) to 
be close to the sensor, which makes it diffi- 
cult to perform long-term monitoring — as 
might be needed for heart-rate monitoring, 
for example. Establishing long-term, contin- 
uous communication between the wearable 
flexible sensors and conventional electron- 
ics will be essential for future applications. 
Alternatively, it might be possible to include 
components that enable memory storage 
and simple digital processing in the flexible 
platform. This could be achieved in the future 
using 2D materials other than graphene”. 

Graphene has now been used as a sensor 
and as a signal transducer in various pro- 
totypes for wearable and mobile health 
devices”. More importantly, however, 
graphene has paved the way for other 2D 
materials to be used in sensors and mobile 
health-monitoring devices. Thousands of 
such materials have been discovered, with 
as-yet unknown properties”. We think that 
the comprehensive study of those materials 
will be essential for the development of future 
biosensors that can be worn by, or even inte- 
grated into, humans. 
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NASA’s Parker Solar Probe mission’ recently plunged through the inner heliosphere of 
the Sun to its perihelia, about 24 million kilometres from the Sun. Previous studies 
farther from the Sun (performed mostly at a distance of 1 astronomical unit) indicate 
that solar energetic particles are accelerated from a few kiloelectronvolts up to near- 
relativistic energies via at least two processes: ‘impulsive’ events, which are usually 
associated with magnetic reconnection in solar flares and are typically enriched in 
electrons, helium-3 and heavier ions’, and ‘gradual’ events>*, which are typically 
associated with large coronal-mass-ejection-driven shocks and compressions moving 
through the corona and inner solar wind and are the dominant source of protons with 
energies between 1 and 10 megaelectronvolts. However, some events show aspects of 


both processes and the electron-proton ratio is not bimodally distributed, as would 
be expected if there were only two possible processes®. These processes have been 
very difficult to resolve from prior observations, owing to the various transport effects 
that affect the energetic particle population en route to more distant spacecraft®. 
Here we report observations of the near-Sun energetic particle radiation environment 
over the first two orbits of the probe. We find a variety of energetic particle events 
accelerated both locally and remotely including by corotating interaction regions, 
impulsive events driven by acceleration near the Sun, and an event related to a coronal 
mass ejection. We provide direct observations of the energetic particle radiation 
environmentin the region just above the corona of the Sun and directly explore the 
physics of particle acceleration and transport. 


Onboard the Parker Solar Probe (PSP), the instrument suite of the Inte- 
grated Science Investigation of the Sun (ISoIS)’ has made the first near- 
Sun measurements of solar energetic particles (SEPs). ISeIS comprises 
two energetic particle instruments with overlapping coverage, EPI-Hi 
and EPI-Lo, measuring higher- and lower-energy particles’. Together 
they enable ISoIS to explore the near-Sun environment by measuring 
the fluxes, energy spectra, anisotropy, and composition of suprather- 
mal and energetic ions with energies from about 0.02 to 200 MeV per 
nucleon and electrons with energies from about 0.05 to 6 MeV. Here 
we examine this energetic particle environment in the context of in situ 
solar wind’ and magnetic field’ conditions and surrounding density 
structures’? measured by other instruments onboard PSP. 

Figure 1 summarizes ISoIS observations of energetic particles over 
PSP’s first two orbits. Higher- (1-2 MeV) and lower-energy (30-200 keV) 


H* ion count rates are plotted on the outside and inside of the orbital 
trajectory, respectively. Intensifications indicate energetic particle 
events, with some seen only at higher energies, some only at lower 
energies, and others simultaneously across the combined energy range. 
Figure lindicates how rich the ISoIS observations are: a broad array of 
different types of particle events are seen at all distances. 

The first large intensification occurred during orbit 1 at higher 
energies with PSP inbound (during interval a, 2018-287 18:00 to 
2018-297 08:20 universal time (UT)) at about 0.5 astronomical units 
(AU). Although not obvious from Fig. 1, this is a corotational event 
also seen when PSP was outbound at about 0.65 Au (during interval 
b, 2018-330 23:20 to 2018-341 15:00 UT). Corotating interaction 
regions (CIRs) form as faster solar wind piles up behind slower wind, 
forming a compression”. Because these faster solar wind streams 
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Fig. 1| Observations of energetic particles during orbits 1 and 2.a,b, 
Observations of energetic particles (primarily H*) at lower (Lo, about 30- 

200 keV; inside track) and higher energies (Hi, about 1-2 MeV; outside track) 
from PSP’s orbit 1 (a) and orbit 2 (b). Intervals without data are indicated by the 


emanate from coronal holes at the Sun, CIRs map to nearly fixed 
solar longitudes. 

Figure 2 shows intervals a and basa function of the longitude of the 
solar surface ‘foot point’ magnetically connected to the spacecraft, 
calculated for a nominal Parker spiral with a fixed solar wind speed of 
350 kms‘. This calculation combines the rotation of the Sun and space- 
craft location to show that both events arise from the same, single CIR 
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black orbital track. Particle intensity is indicated by both the colour and the 
length of the bars. Intervals a—d are earmarked for detailed study. The scale 
indicated by white rectangles on the outer track is measured in days UT, from 
2018-280 (7 October 2018) to 2019-020 (20 January 2019). 


structure. These events are ‘dispersionless’—all ions arrive at roughly 
the same time and fluctuations in intensity are consistent across ion 
speeds. Such events indicate that PSP passed across magnetic flux tubes 
that were already filled with high-energy (>1 MeV) particles that move 
quickly along magnetic field lines. The intensities of sunward- and anti- 
sunward-moving particles in intervals a and b were similar (Fig. 2a), 
consistent with a corotating structure that traps particles between a 
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Fig. 2| Recurring corotating energetic particle events. a—d, Corotating ion 
events observed in intervals a (blue) and b (red) represented by sunward/anti- 
sunward flux time series (a), a count-rate spectrogram (b), the fluxasa 
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function of the magnetic foot point in Carrington longitude and radius from 
the Sun (c), and the 1-2 MeV H* rate versus foot-point longitude (d). Inc, d, 
UVsy=350 kms ‘is anominal solar wind speed. 
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Fig. 3 | CME-related low-energy event and subsequent high-energy event. 
a-e, Time series of primarily proton flux at >1 MeV (a), H* flux around 30- 
500 keV (b), solar wind density (c) and radial speed’ (d), and magnetic field 


source more distant than the spacecraft and the increasing magnetic 
field strength closer to the Sun. The particle acceleration probably 
occurs at reverse shocks, which typically form beyond about 2 Au from 
compressions in such CIRs. 

The inbound leg towards perihelion 1 (which occurred at 0.17 AU, at 
03:28 6 November 2018 UT) was extremely quiet from about 0.4 Au, 
providing an ideal opportunity for other PSP instruments”“ to observe 
very quiet solar wind conditions with essentially no SEP-produced pen- 
etrating backgrounds. ISIS began to observe lower-energy SEPs just 
before perihelion 1, which then increased. Figure 3 shows the events in 
interval c, including low-energy ions ahead of a coronal mass ejection 
(CME; Fig. 3b, f, g), the passage of acompression wave after it (Fig. 3c), 
and a subsequent higher-energy particle event (Fig. 3a). 

ISoIS observations show an SEP event starting early on 2018-315 and 
extending to about when the CME arrived at PSP on 2018-316. Particle 
anisotropies (Fig. 3f) demonstrate that these particles are streaming 
outward fromthe Sun. The faster particles arrive first, characteristic of 
a ‘dispersive’ SEP event (Fig. 3g) with the differing arrival times giving 
an estimate of the distance along the magnetic field to the source of 
their acceleration. For the time-energy slope in Fig. 3g, we estimate 
a path length? longer than that of the Parker spiral from PSP at about 


vector and magnitude“ (e) over interval c. f-h, Magnification of the dispersive 
SEP event and CME for H* flux around 30-500 keV. 


0.25 Au, which might be explained by a longer path length associated 
with magnetic field ‘switchbacks’ observed by PSP in situ”. 

Solar observations from the white-light coronagraph onthe ‘A’ space- 
craft of NASA’s Solar Terrestrial Relations Observatory (STEREO-A) 
indicate that the SEP-associated CME started lifting off from the Sun on 
2018-314 at about 18:00 uT (Extended Data Fig. 1). Derivation of the CME 
speed from STEREO-A imaging (Extended Data Fig. 2) reveals that the 
CME was moving slowly (<400 kms‘) fromthe Sun to PSP, very similar 
tothe surrounding solar wind speed. By propagating this CME flux rope 
ataconstant speed of 380 kms ‘from near the Sun to PSP, we find good 
agreement with the in situ magnetic field observations. Preliminary 
analysis of this event using shock-modelling techniques” suggests that 
there was probably no shock on the field lines well connected to PSP. 
However, a quasi-perpendicular sub-critical shock (Mach number <3) 
could have formed over an extended region of the flux rope and perhaps 
accelerated the protons measured by PSP (A. Kouloumvakos, private 
communication). This energetic particle event was not seen at any of 
the 1AU spacecraft, so such small events may only be observable close 
tothe Sunand therefore much more common than previously thought. 

At the end of 2018-318, the solar wind speed increased from about 
300 to about 500 kms (ref. ®), indicative of a strong dynamic pressure 
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Fig. 4 | Pair of impulsive events near second perihelion. a—e, Two impulsive 
SEP events (during interval d) near the second perihelion of PSP (<40R,) at 
higher energies (a, b), lower energies (c, d), and the magnetic field measured 


wave in the solar wind. ISeIS observed a small enhancement in very- 
low-energy particles (<50 keV) as this compressional wave passed. This 
event is the first direct observation of local energization in the ISeIS 
observations. Shocks are not required for particle acceleration’ and 
plasma compressions can accelerate particles provided that the parti- 
cles are able to propagate across, but remain close to, the compression”. 
The large two-step increase in speed on 2018-318 shows that this com- 
pression wave was well on its way to steepening into a forward-reverse- 
shock pair, which probably accelerates the higher-energy (>1 MeV) 
particles observed from 2018-320 to 2018-324. This is not a CIR as in 
intervals a and b, because it has a much narrower range of foot-point 
longitudes (see H* count rate at about 300° in Fig. 2c, d) and does not 
recur, but instead indicates the interaction of a single fast solar wind 
stream, possibly associated with or even magnetically initiated by the 
preceding CME. In any case, as with CIR-associated particle events, the 
particle isotropy indicates that these ions are trapped on flux tubes, 
probably with a source more distant than PSP. In fact, at the time the 
second event was seen, about 1-6 days after the passage of the compres- 
sion at PSP, the pressure front had expanded outward to heliocentric 
distances of about 0.6-2 Au, where the shocks probably formed. 
Very near perihelion (about 35 solar radii, R,) during PSP orbit 2 
(interval d), ISoIS observed a unique pair of SEP events (Fig. 4). As PSP 


226 | Nature | Vol576 | 12 December 2019 


during the events (e). |B] is the magnitude of the magnetic field and Bg, B; and By 
are the radial, transverse and normal components of the magnetic field, 
respectively. 


is nearly corotational with the Sun near perihelion, the two events are 
magnetically connected to acommon solar source <S5° apart in lon- 
gitude. First, on 2019-092 there was a low-energy dispersive event, 
probably associated with an impulsive source in the low corona. Two 
days later, on 2019-094, there was a different type of impulsive event, 
marked by a substantial increase in ions with >1-MeV energy. Both 
events exhibit strong, persistent magnetic-field-aligned ions stream- 
ing away from the Sun. 

The first event, starting on 2019-092, may be associated with distur- 
bances also observed in extreme ultraviolet images from STEREO-A in 
the vicinity of active region AR2738, as well as multiple type-III radio 
bursts by both STEREO-A and FIELDS“, the magnetic field instrument 
aboard PSP. This small active region was about 70° off the nominal 
magnetic connection of PSP to the Sun. The fluxes of high-energy 
protons are near background, but we observed a substantial num- 
ber of heavy high-energy ions and at low energies (about 30 keV per 
nucleon). He/H is about 20 times higher than the event on 2019-094, 
and increases in O and Fe abundances are even greater. These results 
suggest that this may be a ‘Z-rich’ event’®; such events are relatively 
rare at 1Au. 

The second SEP event on 2019-094 also exhibits velocity disper- 
sion and outward streaming, but has many fewer ions <1 MeV anda 


substantial increase at >1 MeV. As with the event on 2019-092, there is 
potentially related radio and extreme ultraviolet activity in AR2738. 
However, the heavy ion abundances were similar to more typical SEP 
events. The magnetic field observed at PSP (Fig. 4e) between the two 
events was stronger and considerably smoother than before or after, 
indicating that this was probably a single, lower f (particle pressure/ 
magnetic pressure) magnetic structure connecting the two events. 
Further, these observations indicate that processes inside 0.17 AU, as 
suggested by early multi-spacecraft studies in solar cycle 20, as well as 
later Helios and STEREO studies” ”, enable fast, direct access of SEPs 
toawide range of solar longitudes. Later studies that combined in situ 
data with solar source region observations showed that the smaller, 
longitudinally distributed SEP events are associated with multiple 
jet-like coronal emissions” close to the source region as well as with 
more spatially extended eruptions”. 

ISoIS observed arich array of energetic particle phenomena during 
PSP’s first two orbits. Several of these events were not observed by 1 AU 
spacecraft, and so small events only observable close to the Sun may 
be much more common than previously thought. With these new data, 
we are well on the way to resolving the fundamental questions of the 
origin, acceleration, and transport of SEPs into the heliosphere. Over 
the next five years, as we head towards solar maximum, PSP will orbit 
progressively closer to the Sun, ultimately extending our exploration 
of these critical processes to less than 10R,. 
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Extended Data Fig. 1| Viewing geometry and observation of coronal mass 
ejection. a, A view of the ecliptic plane from solar north at 14:00 10 November 
2018 UT showing the relative positions of STEREO-A and PSP. The dashed 
curves, frominnermost to outermost, represent the orbits of Mercury, Venus 
and Earth. The red area shows the field of view of the COR-2 instrument 
onboard STEREO-A. A CME off the east limb of the Sun as viewed from 
STEREO-A would be roughly propagating towards PSP. This CME very gradually 


entered the field of view of COR-2, part of the SECCHI suite of imaging 
instruments” aboard the STEREO spacecraft. b, Arunning-difference image of 
the CME taken at 02:39 UT on 11 November 2018 by COR-2A (a visible-light 
coronagraph), extending in the plane of the sky from 2R, to 15R,, provided 
images during the entire acceleration phase of the CME. This CME entered 
COR-2A around 18:00 UT on10 November 2018 and transited through the COR-2 
field of view over about 12h. 
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Extended Data Fig. 2 | CME model and comparison to magnetic field data. 
a, Thesameas in Extended Data Fig. 1b but with superposed fitted flux-rope 
shape of the CME at 02:39 11 November 2018 UT when the CME had passed 
halfway through the COR-2A field of view. The CME is very weak and no shock-— 
sheath structure can be identified in these images. The typical aspect of the 
CME inthe image results from the line-of-sight integration of plasma 
distribution ona bent toroid such that its major axis is located ina plane 
containing the observing spacecraft (see very similar events in refs. ””°). b, The 
position (red) and speed (blue) of the apex of the flux-rope model was derived 
by iteratively comparing each synthetic image produced by the three- 
dimensional model with each available COR-2A image. A functional form 
(arctangent) was imposed for the flux rope’s varying speed. The fitted CME 
structure assumed here is a bent toroid with an exponential increase of its 
cross-sectional area from foot point to apexas in ref. ”°. The speed was derived 


Nov 11, 12:00 


Nov 12, 00:00 
2018 


by fitting a hyperbolic tangent to the modelled CME position. The speed 
increases rapidly from under 100 kms ‘at 18:00 10 November 2018 uT to over 
350 kms when it exited the COR-2A field of view at around 6:00 UT on 

11 November. c, Aninternal magnetic field structure was expressed analytically 
inside the envelope of the fitted CME (smooth curves) as in ref. °°, but here 
keeping a simple circular cross-section of the flux rope. By propagating this 
flux rope at aconstant speed of 380 kms from the time it exits the COR-2 field 
of view, we predict the CME reaches PSP on 12 November 2018. The predicted 
arrival time and the magnetic properties of the CME (thick smooth line) are in 
good agreement with those measured in situ by the FIELDS (magnetic field 
data; thin lines) and SWEAP instruments (not shown). We therefore conclude 
that the fitting procedure presented here provides a good description of the 
evolution of the CME from the upper corona to PSP. 
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The prediction of a supersonic solar wind’ was first confirmed by spacecraft near 
Earth?” and later by spacecraft at heliocentric distances as small as 62 solar radii‘. 
These missions showed that plasma accelerates as it emerges from the corona, aided 
by unidentified processes that transport energy outwards from the Sun before 
depositing it in the wind. Alfvénic fluctuations are a promising candidate for sucha 
process because they are seen in the corona and solar wind and contain considerable 
energy” ’. Magnetic tension forces the corona to co-rotate with the Sun, but any 
residual rotation far from the Sun reported until now has been much smaller than the 
amplitude of waves and deflections from interacting wind streams®. Here we report 
observations of solar-wind plasma at heliocentric distances of about 35 solar radii? ", 
well within the distance at which stream interactions become important. We find that 
Alfvén waves organize into structured velocity spikes with duration of up to minutes, 


which are associated with propagating S-like bends in the magnetic-field lines. We 
detect an increasing rotational component to the flow velocity of the solar wind 
around the Sun, peaking at 35 to 50 kilometres per second—considerably above the 
amplitude of the waves. These flows exceed classical velocity predictions of a few 
kilometres per second, challenging models of circulation in the corona and calling 
into question our understanding of how stars lose angular momentum and spin down 


as they age”, 


The Parker Solar Probe (PSP) launched in August 2018 on a Delta IV 
Heavy rocket. The high energy of the launch, combined witha gravita- 
tional assist from Venus in September 2018, placed PSP into an eccentric 
orbit with a period of 147 days anda perihelion at a heliocentric distance 
of r=35.7Ro (Ro, solar radius) nearly a factor of two closer to the Sun 
than any previous mission‘. This study uses observations made by 
instruments on the spacecraft during the first two encounters with the 
Sun, in November 2018 and April 2019. Whereas the instruments collect 
data at alow rate far from the Sun, the primary science collection ata 
high rate occurs during the encounter phase of each orbit at r< 54R. 
(0.25 AU). Encounter one (E1) lasted from 31 October to 12 November 


2018, with the first perihelion occurring at 03:27 UT on 6 November. 
During these two encounters the longitude of PSP relative to the rotat- 
ing surface of the Sun barely changed; PSP essentially dove down into, 
and then rose straight up from, a single narrow region above the Sun. 
El and E2 data thus describe a handful of specific solar-wind streams. 

Nearly two million thermal-energy distribution functions of the solar- 
wind protons were recorded during El, and more than three times that 
number during E2 (Fig. 1, Extended Data Fig. 1). From these distribution 
functions, the bulk properties of solar-wind protons—suchas the veloc- 
ity, density and temperature—are derived. Within any hour interval, the 
distribution of the radial solar-wind speed, V,p, was strongly peaked at a 
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Fig. 1| Overview of the first encounter of PSP with the Sun. a, Relative 
occurrence rate of the proton radial speed V,, in one-hour intervals. Red 


triangles show the start and end of the high-rate data collection below54R.and 


the green triangle indicates a perihelion at 35.7Ro. b-f, The same for V,,in the 
solar equatorial plane (b), the proton number density n, (c), the proton 
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temperature 7, (d), the radial component of magnetic field B, (e), the electron 
pitch-angle distribution (PAD) (f) and the 20-200 keV proton rate (g). The date 
(month/day), distance rand latitude relative to the solar equator are indicated 
at daily intervals. 
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Fig. 2| Solar-wind fluctuations near the closest approach. Near-Sun 
fluctuations meet the Alfvénic criteria, but are organized into structures and 
contain density enhancements. a, Magnitude of V,,; (blue) and angle 6, of B 
from the radial direction outwards. b, Magnitudes of n, (green), B (red) and the 
proton thermal speed w, (yellow). ce, Variation of each vector component of 
the velocity (blue) and magnetic field (red) in the radial direction (R), 

the transverse direction in the solar equatorial plane (T) and thenormaltoR 


and T (N). Since the orbit of PSP is within a few degrees of the solar equator, 

N points approximately north, perpendicular to the equatorial plane. Thereisa 
baseline solar-wind speed of about 300 kms ‘andjets where V, jumps by about 
100 kms". The fluctuations are highly Alfvénic, with equal energy in the field 
and the flow, but they are organized into structures instead of being randomly 
distributed, and there is evidence of compressions. 
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Fig.3|Acloser look at a velocity spike. The formatting is as in Fig. 2, but 
focused onasingle1,000-s interval. The left blue region indicates the 105-s 
period in which PSP moved from the ambient plasma into the spike. The central 
core of the spike is indicated by the grey region and lasted for 325 s; itis 


minimum value, witha one-sided tail extending to larger V,., values. Vz 
reached its minimum of 200 kms ‘about a quarter of the way though 
E1 and then rose steadily to about 600 kms. Numerical simulations 
and simple extrapolations of the observed photospheric magnetic field 
suggest that PSP spent all of Elsouth of the global heliospheric current 
sheet, ina region with inward magnetic polarity (B, <0). Near thestart 
and end of E1, PSP sampled slow wind from near the global heliospheric 
current sheet. Closer to the Sun, PSP first observed very slow wind and 
then fast wind, both of which are thought to emerge froma low-latitude 
coronal hole’. Below 40Rz, V,, (the transverse component of the proton 
velocity in the equatorial plane of the Sun) has a net positive value, 
which peaks at the closest approach. This flow may be the long-sought 
signature of plasma co-rotation in the corona. The density peaks inthe 
slowest wind, at a value of approximately 400 cm ®°, about 50 times 
higher than values typically observed at 1 AU, as expected from mass 
conservation and spherical expansion. The proton temperature, 7,, and 
Vx remain positively correlated”*. At perihelion the protons are about 4 
times hotter than protons with similar V,, at 1 AU, consistent with radial 
scalings reported from earlier missions‘. The radial component of the 
magnetic field, B,, increases in magnitude with proximity to the Sun but 
unexpectedly changes sign many times. The pitch-angle (6) distribu- 
tion for electrons (that is, the number of electrons at a given energy as 
a function of their angle relative to B) is a valuable diagnostic of these 
changes in the direction of B. Here we show the pitch-angle distribu- 
tion in a 22-eV-wide energy channel centred on 314 eV, well above the 
electron thermal energy. The sharp peak near 180° corresponds tothe 
strahl, abeam of super-thermal electrons that travel away from the Sun 
along magnetic-field lines. Near the Sun, the strahl evolves towards 
small sin@ values because of magnetic-moment conservation”. If the 
reversals in B, seen by PSP result from the spacecraft’s crossing between 
open field lines (connected to the Sun at only one end) with different 
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characterized by asteady but disturbed flow and afield witha large rotationin 
Bto 6, ~ 70° and ajump in flowto 343 kms“. The return fromthe core spike into 
ambient solar wind is marked by the second blue region and lasted 30s. 


signs of B, back at the Sun, then the strahl would flip between 180° and 
O° each time B, changed sign. Instead, every time By flips, the strahl 
maintains its 180° orientation, clearly indicating that the reversals in 
B, are due to S-like bends in the magnetic-field lines (Extended Data 
Fig. 2). Closed field lines with both ends connected to the Sun and strahl 
travelling in both parallel and antiparallel directions to Bare seen dur- 
ing the arrival of a coronal mass ejection on 11 November at 23:50 UT, 
following an enhancement in the number of energetic particles’. 

Figure 2 shows atimeseries of 80 min of observations several hours 
after perihelion, illustrating typical velocity and magnetic-field fluc- 
tuations. About half the time, B points radially inwards towards the 
Sunand the velocity Vremains at a relatively constant 300 kms. The 
remaining time includes seven distinct intervals in which B rotates away 
from its radial-inward orientation, V,, simultaneously jumps and V also 
rotates, linking the one-sided tail in V,, with the reversals in polarity 
seen inthe El overview. These jumps in flow associated with rotations 
in Band Vare similar to one-sided Alfvénic structures that were first 
seen farther fromthe Sun®”. The spikes seen by PSP are different in that 
they have larger amplitudes and are often associated with an increase 
in the proton density, n,, indicating that the spikes have a non-Alfvénic 
component. The correlated variations in the components of B and V, 
their relative amplitudes and the constant value of |B| are consistent 
with large-amplitude, spherically polarized Alfvén waves propagating 
through the plasma in the anti-Sunward direction, similar to earlier 
observations*”’. We can classify this wind stream (and indeed much 
of E1) as Alfvénic slow solar wind”’. 

About 1,000 long-duration (>10 s) and isolated velocity spikes with 
large rotations in B were identified in El (about half as many were seen 
in E2.) Often the spikes can be separated chronologically into a core 
region with plasma conditions that are very different from those of 
the ambient solar wind but relatively constant, acomparatively short 
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Fig. 4 | Large circulation of solar wind observed near the Sun. Averaged 
rotational (or azimuthal) flow, V,;, over 1.75Ro intervals during El (inbound in 
blue, with error bars indicating the standard deviation from all observations; 
outbound in purple) and E2 (inbound in red; outbound in yellow) asa function 
of radial distance. Each symbol represents the average over at least 10,000 
observations, and the values closest to perihelion are averaged over 60,000- 
230,000 observations. Error bars for Elinbound show one standard deviation 
of the individual observations and are representative of the variation for the 
other three phases. The uncertainty in the mean of V,; is much smaller than the 
marker size. Current and upcoming perihelia are shown with green triangles. 
Lines indicate no rotation (dashed), rigid co-rotation everywhere (red) and the 
axisymmetric Weber-Davis model (blue). 


transition region on one side of the core, anda longer transition region 
onthe other side containing large-amplitude fluctuations (see Fig. 3). 
During the 105-s transition at the beginning of this spike, the flow under- 
went seven large oscillations with an amplitude of 150 kms7, which 
possibly resulted from a Kelvin-Helmholtz instability. 

Equally unexpected as the spikes and B, reversals are the large-ampli- 
tude and sustained positive rotational velocities seen below 40R. for 
Eland 50R> for E2 (Fig. 4). Net rotation has been reported farther from 
the Sun, but it was of the same order as instrument error and much 
smaller than the standard deviation in flow due to fluctuations and 
stream interactions*”. Here V,; rises to 35 kms” (E1) and50 kms“(E2). 
This is much greater than the variance from fluctuations including the 
velocity spikes, there is no evidence of stream interactions, and these 
values are much greater than the precision in the averaged flows (less 
than0.5kms7) andthe absolute error in the flow due toa pointing error 
(less than 3 kms7) (see Methods). These are the first in situ observa- 
tions of net rotational flowin the solar wind that are significantly above 
fluctuations and uncertainty. 

Some level of rotational flow has always been expected in the solar 
wind near the Sun, as magnetic tension in the corona should force 
the plasma to rotate as the Sun spins. However, the large rotational 
velocities measured here greatly exceed the value calculated by the 
axisymmetric Weber-Davis model”, posing a major challenge to our 
understanding of the dynamics of the near-Sun solar wind. Determining 
the origin of these tangential flows will be essential for understanding 
how the Sun loses angular momentum and spins downas it ages”"*”, 
Further studies of the angular momentum should include magnetic 


fields, waves and different ions. Future PSP orbits will clarify the extent 
to which these large rotational flows characterize other solar-wind 
streams. These orbits will also provide critical additional diagnostics 
of the state of the plasma, including turbulence, velocity spikes, tem- 
perature anisotropy and particle velocity distribution functions, at 
heliocentric distances as small as 9.86Ro. 
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Methods 


Data collection and analysis 

The data presented here were collected over the course of the first two 
encounters of the Sun by PSP in November 2018 and April 2019. This 
study makes use of all of the in situ instruments on the spacecraft. The 
thermal plasma properties are measured by the PSP SWEAP (Solar Wind 
Electrons Alphas and Protons) instrument suite”, including the Solar 
Probe Cup (SPC) and the Solar Probe Analyzers (SPAN) for electron and 
ion plasma data. Magnetic-field data from the outboard FIELDS mag- 
netometer were also used”, along with energetic-particle rates as seen 
by ISoIS’®. SPC measures the reduced distribution function of ionized 
hydrogen and helium and the two-dimensional flow angles of the ions 
as afunction of energy/charge. These measurements were performed 
at least once per second and typically more than four times per second 
throughout the encounter phase of each orbit (below 0.25 AU or 54Ro). 
This paper uses moments of the entire SPC proton distribution func- 
tion to calculate the total effective proton velocity, density and radial 
component of the temperature. While the SPAN ion sensor generally 
did not view the peak of the proton velocity distribution, the overlap- 
ping regions seen by SPAN and SPC were compared to confirm that 
there were no gross offsets in the calibration or the derived plasma 
properties such as the velocity; this technique will be more accurate 
when the solar wind flows into SPAN closer to the Sun. Observations 
of electrons with a central energy of 314 eV and a width of 22 eV by 
the two SPAN electron sensors were combined, along with the FIELDS 
determination of the magnetic-field direction, to create the electron 
pitch-angle distributions. 

All data are being archived and will be available for download at 
the NASA Space Physics Data Facility in November 2019 (https:// 
spdf.gsfc.nasa.gov/). Additional SWEAP data and information are 
available at the SWEAP web page (http://sweap.cfa.harvard.edu/). 
Data were analysed and graphics were developed in IDL (Interactive 
Data Language). 


Statistics 

The distributions of plasma properties in Fig. 1 and Extended Data Fig. 1 
were produced with a time resolution of 1h. During the encounters, 
the time resolution of the plasma instrument ranged from slightly 
more than one measurement per second to more than four meas- 
urements per second, so each column represents the distribution of 
approximately 3,600-14,400 measurements. All error bars indicate 
one standard deviation of the measurements from the mean. At least 
10,000—and generally more than 80,000—observations were used to 
calculate the mean transverse flow V,,; in Fig. 4. 


Estimates of uncertainty 

Here we discuss the absolute accuracy of SPC ion measurements. As ver- 
ified by ground testing, the absolute accuracy for V, pis less than 0.01% 
over a measurable range of approximately 119 kms‘to1,065kms_. The 
absolute accuracy in temperature is similarly small over ameasurable 
range of approximately 7.3 kK to 21.1 MK (that is, proton thermal speeds 
of 11kms™to 600 kms"). Speeds and temperatures at the extremes 
of these ranges are subject to systematic considerations, but no such 
measurements are presented here. The accuracy of the density meas- 
urement is determined by comparison with the plasma frequency as 
observed by FIELDS". Thus, the absolute accuracy of the SPC density 
measurement is estimated to be about 1% and is no worse than 3%. 
The absolute accuracy for off-radial flow components is verified via 
spacecraft roll manoeuvres about the SPC symmetry axis. For solar- 
wind fluxes typical of the first two encounters, the uncertainty associ- 
ated with this calibration corresponds to a typical absolute accuracy 
of about 0.5°. For a solar wind of 400 kms ‘this corresponds to an 
expected error in V,, of 3-4 kms“, which is much smaller than the net 
rotational flow observed. 


Signatures of Alfvénic fluctuations 

In discussing Fig. 2, we stated that the correlation of fluctuations inthe 
components of Band Vwere generally indicative of outward-propagat- 
ing Alfvén waves. We consider the vector waves or fluctuations AV and 
AB superimposed ona steady background of B, and V,, respectively. In 
the long-wavelength fluid magnetohydrodynamic limit, Alfvén waves 
propagate exactly in parallel or antiparallel directions to B,, are disper- 
sionless and do not compress the plasma, and there is a simple linear 
relationship of AB = +D,AV, where D, = (n, + 4n,)°°@/21.8 (in units of 
nTkm''s; densities are in units ofcm™) and @=(1-B,+8,)°>. Here Gis 
acorrection for thermal pressure anisotropy, where , is the ratio of the 
parallel plasma pressure to the magnetic pressure and £, isthe ratio of 
the perpendicular plasma pressure to the magnetic pressure. For this 
period we find on average n, = 220 cm™, B, = 0.202 and B, = 0.315. SPC 
and SPAN were not configured optimally to measure the ionized helium 
abundance n,, So assuming the typical range 0.5% < n,/n, < 4.5%, we 
expect D, = 0.68-0.74nT kms. We find D, for the R, T, Ncomponents 
tobe 0.71, 1.09 and 0.70 nT km’'‘s, respectively, so the Rand Ncompo- 
nents are exactly within the expected range and the fluctuations inthe 
T direction are about 33% higher (it is typical for D, to be different for 
each component of the velocity*). We then used the calculated value of 
D, to rescale the range of the vector components of B, so they should 
overlap with V if the fluctuations were purely Alfvénic. The sign of the 
relation between AB and AVis given by the sign of -k-B,, where kis the 
wavevector and gives the direction of propagation, and Bis an average 
direction of the field over a long time scale. The ambient direction of 
the magnetic field outside the large-amplitude fluctuations points 
towards the Sun and the correlations are overwhelmingly positive, 
meaning that we are seeing outward-propagating waves. 


Identification of velocity spikes 

Isolated velocity spikes were identified by looking for all intervals in 
each encounter in which the orientation of the magnetic field started 
inthe quiet configuration pointed towards the Sun within 30°, rotated 
more than 45° away from the quiet configuration for at least 10 s, and 
then returned to the original direction. Candidate events were then 
examined manually to identify starting and ending times. 
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Extended Data Fig. 1| Overview of the second PSP encounter with the Sun. 
The figure is inthe same format as Fig. 1. Spikes in the velocity are again seen to 
be coincident with the magnetic-field reversals, but the jump in the speed is 
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smaller, probably because the Alfvén speed was lower in E2 than E1. The density 


at perihelion is substantially lower. 
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Extended Data Fig. 2 | Schematic of an S-shaped magnetic structure creating 
afield reversal, heat-flux reversal and a spike in velocity. This figure 
illustrates the possible geometry of an S-shaped propagating Alfvénic 
disturbance (grey box) and how it would appear to the spacecraft (black 
square) as it flew through the spike on the green trajectory. The pink lines with 
arrows indicate the configuration of the magnetic field, with all field lines 
ultimately pointing back to the Sun. Arrows at each black square indicate the 
vector velocity (blue), electron strahl (orange) and magnetic field (red) seen by 
the spacecraft. If this was a purely Alfvénic structure, then the spike would 
move away from the Sun in an antiparallel direction to B at the local Alfvén 


speed, C,. Inthe frame of the spike, the shape of the structure would be static, 
with plasma flowing in along field lines onthe upper left and through the spike 
and emerging at the lower right, always flowing at C,. Inthe frame of the 
spacecraft, the constant flow along field lines in the propagating spike frame 
would translate into a radial increase of Vby C, when Bis perpendicular totheR 
direction, anda maximum jump of 2C, when Bis completely inverted. Because 
the heat flux flows away from the Sun along magnetic field lines, it would rotate 
soas to always be antiparallel to Band appear locally to be flowing back to the 
Sun atthe centre of this disturbance. 
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Remote observations of the solar photospheric light scattered by electrons (the 
K-corona) and dust (the F-corona or zodiacal light) have been made from the ground 
during eclipses! and from space at distances as small as 0.3 astronomical units” * to the 
Sun. Previous observations® ° of dust scattering have not confirmed the existence of 
the theoretically predicted dust-free zone near the Sun? ". The transient nature of the 
corona has been well characterized for large events, but questions still remain (for 
example, about the initiation of the corona” and the production of solar energetic 
particles”) and for small events even its structure is uncertain". Here we report 
imaging of the solar corona” during the first two perihelion passes (0.16-0.25 
astronomical units) of the Parker Solar Probe spacecraft”, each lasting ten days. The 
view from these distances is qualitatively similar to the historical views from ground 
and space, but there are some notable differences. At short elongations, we observe a 
decrease in the intensity of the F-coronal intensity, which is suggestive of the long- 


sought dust free zone 


9-11 


. We also resolve the fine-scale plasma structure of very small 


eruptions, which are frequently ejected from the Sun. These take two forms: the 
frequently observed magnetic flux ropes””’ and the predicted, but not yet observed, 


magnetic islands 


17,18 


arising from the tearing-mode instability in the current sheet. Our 


observations of the coronal streamer evolution confirm the large-scale topology of 
the solar corona, but also reveal that, as recently predicted”, streamers are composed 
of yet smaller substreamers channelling continual density fluctuations at all visible 


scales. 


The Parker Solar Probe (PSP) carries an imaging instrument, the Wide- 
field Imager for Solar Probe (WISPR)*. The inset in Fig. lashows a WISPR 
inner telescope (WISPR-I) image taken on 6 November 2018 at the first 
perihelion. The Sun is 13.5° to the left of the image and the width is 
about 40°. The locus of points at the apex of the contours defines the 
photometric axis of the F-corona. While most observations of the 
F-corona (or zodiacal light) have been taken from 1 astronomical unit 
(AU) away from the Sun, two spacecraft, Helios A and B, each carrying 
the Zodiacal Light Experiment’, orbited the Sun from 0.3 to 1.0 AU, one 
observing above the ecliptic plane and the other below. They measured 
the intensity / of the zodiacal light from varying heliocentric distances 
and found” that it increases towards the Sun according to/« Ro”, where 
n=2,3+0.land Rois the radius of the Sun. The upper and lower limits 
were recorded at small and large elongations from the Sun, respec- 
tively, and were independent of the ecliptic longitude of the observer. 
The Sun Earth Connection Coronal and Heliospheric Investigation* 
(SECCHI) heliospheric imagers HI-1”, onboard the STEREO spacecraft” 


orbiting the Sun at approximately 1 Au, observed the coronaat elonga- 
tions ranging from 0.07 to 0.45 AU (5°-24°) from the Sun. An analysis of 
intensities” of the photometric axis of the F-corona from 2007 to 2014 
found the exponent, for the entire elongation range covered by the 
HI-1 instrument, to be 2.31. Moreover, the analysis performed on 
restricted elongation ranges” showed an identical tendency to the 
Helios results for the intensity gradient to increase towards the Sun 
(n=2.29+0.10). 

Figure la displays a log-log plot of a sample of F-coronal intensity 
profiles in units of mean solar brightness (MSB) along its photometric 
axis as measured by WISPR-I between 15° and 50° elongation from 
the centre of the Sun. The sample comprises data from five different 
heliocentric distances of the PSP spacecraft (0.336 AU to 0.166 AU) 
obtained during the orbit inbound to the first perihelion. Colour is 
used to distinguish the plots. For clarity we plot only these five posi- 
tions, but all the profiles during the encounter are similar. These five 
profiles are normalized to the maximum intensity at 30° elongation 
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Fig.1| Intensity plots along the photometric axis of the F-corona. 
a, Observed intensities from WISPR-I for five heliocentric distances asa 
function of elongation (degrees) scaled to the same value at 30° elongation. 


to reveal the behaviour of the profiles for the various PSP heliocentric 
distances. Clearly, at larger elongations, the curves have exactly the 
same slope, and at shorter elongations (<20°), the intensity decreases 
with decreasing PSP distance, with the top plot (pink) for when PSP is 
the furthest from the Sun and the bottom plot (dark green) the closest. 
Figure 1b shows the intensity profiles at the same five PSP distances 
for both telescopes, but plotted against elongations converted to Ro. 
The conversion to Ro was performed by dividing the elongation by 
half the angular size of the Sun at the respective PSP distance. We note 
that the curves all overlie each other now, even the decreases seen 
in Fig. la. The small upward ticks are due to bright stars. The dashed 
blue line in Fig. 1b shows the linear fit to the F-coronal intensities for 
elongations between 20R. and 77R. (n=2.31), aresult identical to that 
obtained from both earlier observations”. For comparison, historical 
data®”*> have been added. The dashed green line depicts the linear fit 
to the LASCO-C3 data? (light green dots) for elongations greater than 
13Ro, extrapolated down to 4Ro. The exponent, n, in this case is also 
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The inset ina shows an image of the F-corona taken on 6 November 2018. b, 
Observed intensities from both telescopes for five heliocentric distances asa 
function of elongation (solar radii). See text for further explanation. 


2.31 (note the match between the blue and green dashed lines). The 
LASCO-C3 data were normalized to the WISPR value at 20Ro. For WISPR 
the absolute calibration was determined by analysing the intensity of 
stars in the field, which resulted in an error of 12%. The relative accu- 
racy and repeatability of the WISPR are excellent, which gives us high 
confidence inthe turnover of the intensities below 17Ro. The historical 
measurements represented by the black dots™ and green dots’ both 
have absolute errors of 20%. On the other hand, no error was given for 
the data represented by the red dots”. 

Figure 2 shows the K-corona from both telescopes on 6 Novem- 
ber 2018, after removal of the brighter background from the dust 
scattering. Supplementary Videos 1and 2 provide background-removed 
videos of the images taken during the first two encounters. The grid 
lines for both Fig. 2 and the Supplementary Videos are in the HPLN- 
ARC, HPLT-ARC coordinate system”*””. The videos show the evolution 
of acoronal streamer during the two encounter periods of PSP obser- 
vations. On the large scale, the agreement with model predictions 


10°} 


0°; 


HPLT-ARG 


-10° 


aN 
-—20°4 


2018-11-06 01:51:32 ut 


+ 


15° 30° 45° 


60° 75° go° 105° 


HPLN-ARG 


Fig. 2 | Combined images from the inner and outer telescopes of WISPRon 

6 November 2018 at 01:44 uT. After removal of an empirical model of the 
F-corona, the faint solar wind structures are revealed. A faint streamer 
outlining the heliospheric current sheet is visible, as are faint, radial and diffuse 
rays, all with apparent origin on the Sun. The image also reveals the dust trail 


along the orbit of the asteroid 3200 Phaethon (delineated by the white dots). 
The Galaxy dominates the scene inthe inner part of the outer telescope 
accompanied by two bright objects: Jupiter (to the upper right) and the star 
Antares (alittle below to its left) inthe Scorpius constellation. 
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Fig. 3 | The propagation ofa CME. Shown are five cropped frames from Supplementary Video 1 at different times in the same coordinate system as Fig. 2. The 
radial range is shown at the top, and the latitudinal range is 0° + 10° for each panel. The yellow and red arrows point to structures described in the text. 


by our team (Extended Data Fig. 1) is very good, validating the model 
assumptions about the configuration of the magnetic field and the 
mass flux of the equatorial solar wind. The representation of WISPR-I 
images ina latitude versus time format (Extended Data Fig. 2a) reveals 
that near perihelion WISPR suddenly imaged faint coronal rays that 
are distinct from the main streamer rays. We note that fine structure 
along the streamer belt has been observed before’’”®. High-resolu- 
tion simulations of the corona reproduce these brightness features. 
We interpret their displacement to higher apparent latitudes to the 
spacecraft motion (Extended Data Fig. 2b). This striated ‘texture’ of 
the background coronais caused in our model by the spatial variability 
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of coronal magnetic flux tubes, along which the plasma is heated and 
accelerated to form the slow solar wind. 

Supplementary Video 1 shows a series of ejecta along the streamer. 
A particular event characterized by a big magnetic flux rope followed 
by several smaller ones is shown in Fig. 3. The first one (yellow arrows) 
has an elliptical high-density envelope surrounding a quasi-circular 
density depletion at its centre anda striated envelope. Although similar 
structures have been observed by LASCO, they could not always be 
resolved. The Encounter 1 images were binned 2 x 2 pixels, giving an 
effective 2-pixel spatial resolution” of 60 arcsec (at 0.21 AU) for WISPR-I, 
which is about 2x finer than the LASCO-C3 observations of this event. 
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Fig. 4 | Formation and propagation of an island-like structure withina 
streamer. The first five panels show snapshots from WISPR-I during the second 
perihelion. An ellipse (green line) is fitted to the high-density ring of the 
structure in each panel. The final panel shows the aspect ratio (minor axis/ 


On the other hand, LASCO-C2 tracked this structure when it was much 
closer to the Sun for a fewimages only, but with about 2.5x better spatial 
resolution. The event was also recorded by WISPR-O with an effective 
2-pixel spatial resolution of 96 arcsec, extending the coverage of the 
event. Such density features have been interpreted as the boundaries 
of magnetic flux ropes”"*. We have combined the spatially resolved 
density information with modelling to locate the structures corre- 
sponding to the internal toroidal and poloidal magnetic fields and study 
their interactions with the ambient plasma as the structure expands 
inside the streamer rays. Preliminary work demonstrates (Extended 
Data Fig. 3) that the structures are indeed consistent with a force-free 
magnetic flux rope propagating along the heliospheric current sheet, 
which is quite flat during this period. The heliospheric current sheet 
flatness may be the reason for detecting the fine-scale structure of the 
event. Such behaviour is extremely rare in 1 AU observations”. 

The event shown in Fig. 3 (on 1 and 2 November 2018) also shows 
two additional smaller flux ropes (red arrows) following the northern 
boundary of the main flux rope; these are probably by-products of 
the interaction between the main event and the ambient corona. The 
quasi-circular shape and faint striations within the feature strongly 
suggest that it is an idealized magnetic flux rope. Smaller features, 
with similar morphologies, are also seen following the main ejection. 
The yellow arrows follow the first, main event. The red arrows follow 
the two following events, until they become merged with the back- 
ground, although they are still visible in Supplementary Video 1. These 
structures were not detected by either LASCO or SECCHI, although 
both have observed many small ejecta’. Small dense features caused 
by interaction of a coronal mass ejection (CME) with its environment 
have proved difficult to identify positively*°. Further studies will be 
necessary to test that idea. 

Observations during the second perihelion (Supplementary 
Video 2) again show new dynamics ina coronal streamer. In this case, 
the observations capture the formation of oblong structures consist- 
ent with magnetic islands. Magnetic islands are, in two dimensions, a 
collection of roughly elliptical magnetic field lines that close on them- 
selves; or, in three dimensions, helical field lines wrapping around a 
central (guide) field, again with a roughly elliptical cross-section. These 
island structures are predicted to form via the tearing-mode instabil- 
ity” from magnetic reconnection ina current sheet, such as the one 
within this streamer, where oppositely directed magnetic fields meet. 
Figure 4 shows several snapshots of this streamer and the formation 
and evolution of one of these oblong structures. This structure first 
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major axis) of these ellipse fits versus time. The red and blue solid lines showa 
linear fit to this aspect ratio from 20:00 UT to 23:45 UT on 6 April 2019 (red) and 
from 00:00 UT to 02:40 UT on7 April 2019 (blue). The pairs of dashed lines on 
either side of these fits show the 1-sigma values. 


appears at the inner edge of the image around 6 April 2019 20:00 UT 
and propagates out, within the current sheet, as an expanding, highly 
elliptical shape with a high-intensity (dense) ring of emission surround- 
ing a low-intensity core. The final panel of Fig. 4 shows a measure of 
the aspect ratio (ratio of minor axis to major axis) of the ellipses fit- 
ted to this structure in each of the 33 frames from time 6 April 2019 
19:57 UT to 7 April 2019 02:54 UT. Each ellipse was fitted to a set of points 
placed by hand on the high-density ring of the oblong structure in each 
frame. The corresponding ellipse is shown as a green curve in the five 
snapshots here, with a red dot at the centre of the ellipse. The plot 
indicates that the structure expands with a slightly increasing aspect 
ratio until 23:45 UT on 6 April 2019 and then it increases more quickly 
until the entire structure fades into the background. This evolution, 
including the increase in aspect ratio, is consistent with simulations of 
the tearing-mode formation of islands in an expanding coronal wind®. 
These simulations show the un-reconnected guide field collecting at 
the centre, forming this low-emission core, with the reconnecting field 
forming the high-density ring around the core. Although such an island 
ejection from a coronal streamer has been reported previously”, the 
earlier observations were not sufficiently resolved to show this internal 
ring and core structure. 

WISPR imaged a variety of interesting structures in the corona/solar 
wind during the first two PSP orbits about the Sun. The departure from 
linearity of the F-corona intensity profiles below about 17R. is opposite 
to that found in both Helios and STEREO data. Although this behaviour 
could be leading to the predicted dust-free zone close to the Sun, the 
intensity decrease could be due to a change in the properties of the 
dust scattering, or acombination of the two. WISPR has certainly not 
observed the dust-free zone. Theoretical analyses of the plausible exist- 
ence of a dust-free zone predict® " the formation of circumsolar dust 
bands that could be observed by their thermal emission. Ina compila- 
tion of the 30 observations® made at various wavelengths from 0.8 pm 
to 3.6 um during eleven solar eclipses from 1966 through to 1998, about 
half indicated an enhancement and the other half, including the two 
latest eclipses in 1991 and 1998, did not. The resolution of whether this 
WISPR finding represents dust depletion or something else will have 
to wait until PSP steps down to lower perihelia. 

The near-corotation of PSP allows us to observe the radial outflow 
of the solar wind, without the confusing impact of solar rotation. The 
observations suggest that many small ejecta, commonly called ‘blobs’, 
may indeed be magnetic flux ropes but are usually too small to identify 
as such from 1 Au (ref. ”). Structures larger than these are generally 
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interpreted as CMEs, but the physical mechanism of formation may 
not be the same. This finding, particularly with the anticipated meas- 
urements of the same structures by PSP’s in situ payload, may finally 
clarify the evolution of the CME magnetic structure in the heliosphere, 
opening up avenues of research on internal CME dynamics. As PSP steps 
closer to the Sun over the next five years, these observations, together 
with the modelling, will certainly provide insights and opportunities 
to study and separate the spatial and temporal variability of the solar 
wind near its source and will probably increase the performance of 
space weather prediction schemes. This will benefit a wide range of 
communities from basic physics research to space situational aware- 
ness to even astrophysics through exoplanet habitability applications. 
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Methods 


WISPR contains two telescopes and measures the intensity of the vis- 
ible light corona in addition to stellar and galactic sources. The two 
telescopes slightly overlap and have a combined field-of-view of 13.5° 
to 108.5° from the Sun, corresponding to approximately 9 to 78 solar 
radii (Ro; [Ro = 696,000 km) at 35.6. perihelion. The visible light 
corona consists of two components: light scattered by free electrons 
(the K-corona) and light scattered by interplanetary dust (the F-corona). 
The F-corona of each WISPR image is removed using a technique” 
similar to that developed for the SECCHI/HI-1. The primary difference 
is that for the HI-1 images the initial step in the procedure analysed 
the horizontal lines in the image, whereas here, the initial step uses 
the vertical lines in the images. 

All ofthe data presented here have been calibrated into Mean Solar 
Brightness (MSB) units. The calibration details will be published ina 
future paper, but include the removal of geometric distortion, vignet- 
ting, instrumental artefacts (stray light, and so on) and then applying 
the photometric calibration of the system. The vignetting is caused 
by two sources: the projection of the image onto the two-dimensional 
plane of the Advanced Pixel Sensor detector and for WISPR-I the obscu- 
ration of the objective lens of the sunward side of the image by a series 
of baffles (including the PSP heat shield) which are used to block the 
solar disk illumination and block diffraction from the edges of the 
preceding baffles. The absolute calibration is confirmed on-orbit by 
measuring the intensity of stars passing through the field. The inten- 
sity of the stars as they transit across the image is also a check on the 
vignetting correction. 


Code availability 


The code used in the WISPR pipeline and analysis is available as part 
of the SolarSoft library (https://sohowww.nascom.nasa.gov/data/ 
software.html). 


Data availability 


The PSP Science Data Management Plan (https://sppgway.jhuapl.edu/ 
docs/data/7434-9101 Rev_A.pdf) requires that all science data fromthe 
first two orbits with calibrations must be released to the public within 
six months of downlink of the first orbit. In addition to this datatype, 
we will be releasing background subtracted images, videos, and lists 


of events. Furthermore, the data must be delivered to the appropri- 
ate NASA/GSFC facility and integrated into the Virtual Observatory. 
Thus, the datais available from 12 November 2019. A complete archive 
is maintained at NRL (https://wispr.nrl.navy.mil) and will be pub- 
licly available at least during the full mission lifetime. A copy of the 
WISPR data will be located at the NASA/GSFC SDAC facility (https:// 
umbra.nascom.nasa.gov) and integrated into the Virtual Solar 
Observatory. 
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Extended Data Fig. 1| Comparison of observations and synthetic 
observations from magnetohydrodynamics model. a, Animage fromthe 
inner WISPR telescope taken on3 November 2018 at 06:55:41 UT. The field of 
view (of both panels) is 40°x 40° with the Sun 13.5° to the left. Two distinct sets 
of bright streamer rays are marked by red arrows. They are separated bya 
darker region marked by a blue arrow. The technique employed to remove the 
background F-coronain the WISPR image has artificially enhanced this dark 
region. The streamer rays located northwards of the dark region (top red 
arrow) are brighter than the rays situated southwards of the dark region 
(bottom red arrow). b, Asynthetic white-light image produced from 


three-dimensional simulations of the solar wind by the MULTI-VP 
magnetohydrodynamics code using a Wilcox Solar Observatory photospheric 
magnetogram*. The three-dimensional density cubes produced by running 
the MULTI-VP code were processed by a white-light rendering code computing 
the brightness of the corona in the WISPR field of view from the heliocentric 
position of Parker Solar Probe. The MULTI-VP numerical model and the 
procedure to produce white-light images have been detailed*’. The star field 
from the new Hipparchus astrometric catalogue™ was added to the simulated 
image in b for comparison with the WISPR image in a. 
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Extended Data Fig. 2| Latitude versus time maps— observations and 
modelling. HEEQ, Heliocentric Earth Equator. a, A representation of WISPR 
inner telescope images in the form ofa latitude versus time map. This map 
provides asummary of the temporal and spatial variability of coronal rays 
observed during the first encounter. We note that such fine structure along the 
streamer belt has been observed before’””*. We identify in these maps the main 
streamer rays already seen in Extended Data Fig. 1 (the same blue and red 
arrows are shown here). During the period of super and corotation (5 to9 
November 2018), bright coronal rays drift in latitude away from the equator 
(green arrows). This is also visible in Supplementary Video 2. b, An equivalent 
map toa obtained from the WISPR synthetic images based on the MULTI-VP 
three-dimensional density cubes shown in Extended Data Fig. 1b. These 
medium-resolution simulations reproduce the time-varying aspect of 

the main streamer including their fading during perihelion (5 to 7 November). 
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c, MULTI-VP high-resolution simulation results for the period 5 to 9 November 
2018 based on 2-degree resolution magnetograms produced by the Air Force 
Data Assimilative Photospheric Flux (ADAPT) model®. The colour table has 
been saturated in these maps to enhance the features. The solar wind 
simulations reveal the finer striated structure of the corona and the coronal 
rays migrating poleward as observed by WISPR (green arrows). A searchin the 
simulation data cubes reveals that these faint rays are separate from the 
brighter streamer rays. They form in the simulation as a result of considerable 
variability in the properties of the magnetic fields along which the slow solar 
wind forms. Since the prescribed coronal heating is scaled to the magnetic field 
properties this drives different mass flux along different flux tubes. We 
interpret the coronal rays marked by the top red arrows as resulting fromthe 
main streamer and the rays situated southwards (bottom red arrow) as 
resulting ofa pseudo-streamer. 
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Extended Data Fig. 3 | Modelling ofa CME asa 3D flux rope. a, Animage from 
the inner WISPR telescope taken on 1 November 2018 at 19:30:50 UT during the 
passage ofa pristine CME. Clear substructures are discernible in the WISPR 
image. The field of view is 40° x 40° with the Sun 13.5° to the left. A bright ring 
at the outer contour/boundary of the CME is indicated by a blue arrow. A 
striking feature of this CME event is the presence of a dark circular core located 
at the centre of the CME event and indicated by ared arrow. b, The same image 
as ina but with the results of a three-dimensional flux rope fit superimposed. 
This figure proposes an interpretation for the different features observed by 
WISPR based on our current understanding of the appearance of CMEs imaged 


in white light. The magnetic field lines (computed from solutions of the Grad— 
Shafranov equation) of the CME are traced inside this flux rope. The bright ring 
(blue arrow) corresponds to plasma located on the boundary of the flux rope 
where the poloidal magnetic field lines of the CME are adjacent to the ambient 
solar wind plasma. The dark core (red arrow) marks the location where strong 
toroidal (axial) magnetic fields dominate the plasma locally. Detailed 
modelling of the event will be presented ina future dedicated publication. We 
acknowledge the use of the Wilcox Solar Magnetograms used in this paper, 
obtained from the website at http://wso.stanford.edu. 
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During the solar minimum, when the Sun is at its least active, the solar wind’ is 
observed at high latitudes as a predominantly fast (more than 500 kilometres 

per second), highly Alfvénic rarefied stream of plasma originating from deep within 
coronal holes. Closer to the ecliptic plane, the solar wind is interspersed with amore 
variable slow wind? of less than 500 kilometres per second. The precise origins of the 
slow wind streams are less certain‘; theories and observations suggest that they may 
originate at the tips of helmet streamers”, from interchange reconnection near coronal 
hole boundaries”*, or within coronal holes with highly diverging magnetic fields”””. The 
heating mechanism required to drive the solar wind is also unresolved, although 
candidate mechanisms include Alfvén-wave turbulence”’”, heating by reconnection in 
nanoflares”, ion cyclotron wave heating” and acceleration by thermal gradients’. Ata 
distance of one astronomical unit, the wind is mixed and evolved, and therefore much of 
the diagnostic structure of these sources and processes has been lost. Here we present 
observations from the Parker Solar Probe” at 36 to 54 solar radii that show evidence of 
slow Alfvénic solar wind emerging froma small equatorial coronal hole. The measured 
magnetic field exhibits patches of large, intermittent reversals that are associated with 
jets of plasma and enhanced Poynting flux and that are interspersed in a smoother and 
less turbulent flow witha near-radial magnetic field. Furthermore, plasma-wave 
measurements suggest the existence of electron and ion velocity-space micro- 
instabilities°”* that are associated with plasma heating and thermalization processes. 
Our measurements suggest that there is an impulsive mechanism associated with solar- 
wind energization and that micro-instabilities play a part in heating, and we provide 
evidence that low-latitude coronal holes area key source of the slow solar wind. 


The first solar encounter of the Parker Solar Probe (PSP) occurred dur- 
ing the solar minimum. The spacecraft orbit remained within 5° of the 
heliographic solar equator and, unlike any previous spacecraft, was 
corotational with the Sun for two intervals surrounding perihelion. 
Figure 1summarizes the radial magnetic field (B,; in heliocentric RTN 
coordinates; see Methods) structure observed by the FIELDS experi- 
ment” fora six-week time interval centred on perihelion (6 November 
2018). Figure 1a shows 1-s cadence measurements of B, (see Methods) 
which show the overall 1/7’ behaviour expected from simple flux-con- 
servation arguments’ as the heliocentric distance of PSP varied along 
its eccentric orbit. Against this background, dramatic and unexpected 
rapid polarity reversals where 5B,/|B| is of the order of lare superposed 
(|B| isthe magnitude of the magnetic field). One-hour statistical modes 


(most probable value; see Methods) of B, in Fig. 1b remove the tran- 
sient polarity inversions and reveal the large-scale magnetic structure. 
Time series predictions of By generated from the simple, but widely 
used, Potential Field Source Surface (PFSS) model’? are shown for 
comparison in black and green. The implementation of this model 
and the procedure to connect it to the location of PSP and generate 
the time series is discussed in the Methods section ‘PFSS modelling 
and connection to PSP’. 


Magnetic field structure 


PFSS is a zero-current force-free model of the global solar corona, 
meaning that it assumes that magnetic pressure dominates over gas 


The list of affiliations appears at the end of the paper. 
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Fig. 1| Radial magnetic field measurements are highly structured, map back 
tothe Sun, and are consistent with alowsource surface. a, The measured 
radial magnetic field By is comprised ofa large-scale field, which scales 
approximately as 1/r? (red dotted lines; also in Fig. 1b) and rapid large- 
amplitude polarity reversals (5B,/|B| of the order of 1) associated with jets of 
plasma (Fig. 2b). b, One-hour statistical modes of By (bisymmetric logarithmic 
plot) show the large-scale radial field coloured for polarity (red, outward; blue, 
inward). Predicted radial-field profiles froma PFSS model are over-plotted 
using asource-surface radius of R,,=1.2R, (black curve, unscaled) and 2.0R, 
(greencurve, multiplied by a factor of 6.5).R,,at1.2R, reproduces many of the 
measured polarity changes (labelled A, C, FandG). The R,,=2.0R, model better 
predicts the timing of polarity inversion G (see Methods). Labels Band E 
indicate transient events, and the perihelion coronal hole interval is centred on 
D. Corotations (CR1 and CR2; green) and the perihelion (PH; red dot) at 35.7R,, 
are labelled.c, Anextreme-ultraviolet synoptic map of 171-A (Fe 1x) emission 


pressure (that is, itassumes alow plasma # value), so that the problem 
reduces to magnetostatics, giving a solution of a static field configu- 
ration that rigidly corotates with the Sun. The role of gas dynamics is 
approximated by requiring that the tangential magnetic field vanishes 
at aspherical ‘source surface’ at some radius R,<, which simulates how 
the outflowing solar wind drags the field lines out into the heliosphere. 
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@(®) Outward (inward) polarity at spacecraft 


shows structure associated with active regions (small-scale extreme- 
ultraviolet bright points appear as red patches) and lower-density plasmain 
coronal holes (darker regions). The PSP trajectory at the source surface is 
superimposed, coloured as in b for measured field polarity. Encounter 1 begins 
at the orange diamond, moves westwards (in decreasing Carrington longitude, 
with respect toa fixed point onthe solar surface) across the map through 
perihelion at about 330°, and ends at the yellow diamond. A line shows the 
location of the model polarity-inversion line (PIL) at the source surface 

(Rss =1.2R,, black; Rg, =2-0R,, green). Red and blue squares indicate the 
polarity on either side of the PIL models. Red (Bg > 0) and blue (B, <0) lines map 
the magnetic field from R,, back to the photosphere for Rs; = 2.0R,,; for 
Rs,=1.2R,, the model field lines are radial. d, The extreme-ultraviolet map of the 
perihelion interval, showing field lines mapping back to the Sun into a small 
equatorial coronal hole, and the location of the adjacent PIL associated with the 
heliospheric current sheet, from the 2.0R, model. 


The magnetostatic approximation limits the accuracy and applicability 
of the model. Nevertheless, PFSS is widely used as a computationally 
tractable first approximation and forms the basis for more sophisti- 
cated models”. We note that PSP encounter 1 took place very close 
tosolar minimum, with low solar activity, reducing the impact of non- 
potential transient events and active regions. 
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Fig. 2| Magnetic field reversals and plasma jets carry a Poynting flux. a, Time 
series measurements of the magnetic field magnitude |B] (black line) and total 
plasma density n, (blue line) show anti-correlation during jet events, consistent 
with magnetohydrodynamic slow-mode behaviour. b, Radial Poynting flux 

(Sz, black line) and ion kinetic energy flux (F,, blue line), showing large 
enhancements during jet-field reversal events. c, Tangential components of 
the magnetic field (B,;; black) and plasma velocity (v,, green) components 


In Fig. lb, two model evaluations are shown with R.. =2.0R, (Ro, solar 
radius; green line) and Rc = 1.2R, (black line). In both cases R,, is well 
below the established value”. However, this value is necessary to pro- 
vide good agreement for all model inputs (see Methods) and is not 
without precedent””>. Model comparison reveals an overall very good 
agreement for both models, but also shows that the polarity inversions 
at features A and C are washed out except with the lower source-surface 
radius (black line). Meanwhile, the timing of feature Gis better captured 
with the higher source-surface radius (green line), illustrating the dif- 
ficulty that PFSS has with assuming a single source-surface radius, and 
supports previous findings of a varying ‘true’ source-surface radius”. 
Finally, Fig. 1c, d depicts field-line mappings derived from the same 
PFSS models shown in Fig. 1b to connect the spacecraft down to the 
lower corona to establish context for the in situ measurements. The 
spacecraft trajectory is shown projected onto the source surface, col- 
oured by its measured polarity. 

The background is a synoptic map of extreme-ultraviolet emission 
in the 171-A wavelength for which dark regions imply lower-density 
plasma and the likely location of open magnetic field lines. For refer- 
ence, this background is shown in isolation in Extended Data Fig. 4 
along with its corresponding map for the 193-A wavelength. The neutral 
lines derived from the PFSS models are shown as single contours in 
the same colour as their time series in Fig. 1b. Figure 1c shows how the 
neutral line topology explains the polarity inversions measured by 
PSP. Figure 1d is a magnification of the two-week interval closest to 
perihelion (330° longitude). During the entire two-week corotation 
loop period, PSP remained connected to a small, negative-polarity, 
isolated equatorial coronal hole, suggesting that the rapid magnetic 
field polarity reversals seen in Fig. 1aare magnetic structures emerging 


showing Alfvénic fluctuations. d, The normal components of the magnetic 
field (By, black line) and the plasma velocity (Uy, green line). e, Radial magnetic 
field (Bg, black line) and plasma velocity (vg, green line), showing an interval of 
quiet radial-field wind and flow adjacent to a magnetic structure associated 
with jets of plasma. Measurements are made from around 00:00 to 03:005 
November 5 2018 universal time (UT) at about 36.6R,. The Alfvén speed during 
the quiet interval isv,~100kms™. 


from this coronal hole and sweeping past the PSP spacecraft. Extended 
Data Fig. 5 shows the configuration schematically. For most of this 
interval, SWEAP~’ measurements of the solar-wind velocity indicated 
an Alfvénic slow wind stream (see Fig. 2), suggesting a slow solar wind 
source rooted in an equatorial coronal hole at the Sun. Polarity inver- 
sions B and E in Fig. 1b are associated with (transient) flux rope and 
coronal mass ejection”’ events, respectively. 


Alfvenic fluctuations and plasma jets 

Time series magnetic field and velocity structures show the correlations 
(Fig. 2c-e) expected of propagating Alfvén waves”, especially during 
the quiet radial-field intervals. The 5B, polarity reversal intervals show 
enhanced radial wind velocity v, (Fig. 2e) and the Alfvénic correlations 
of velocity and magnetic field (Sv to 6B) within the polarity inversions 
and jets suggest that these structures may be interpreted as large- 
amplitude, three-dimensional Alfvénic structures convected away 
fromthe Sun. Asasimple measure, statistics of zero-crossings (polarity 
reversals; see Methods) show that around 6% of the temporal duration 
of encounter lis comprised of these so-defined jets. Many jet intervals 
show signatures of compressibility (Fig. 2a)—in this case anti-correlated 
plasma density n, and |B|—suggesting slow-mode or pressure-balanced 
behaviour”. Isolated Alfvénic features associated with magnetic field 
reversals have been identified at 60R, (ref. *'), near one astronomical 
unit (AU) and in the polar heliosphere by the spacecraft Ulysses”; 
however, at those distances few or no compressive signatures were 
present. It has been suggested* that these magnetic structures could be 
signatures of impulsive reconnection events in the Sun’s atmosphere”; 
simulations*’ show qualitative similarities to the events of encounter 1 
but do not reproduce the observed magnetic field reversals past 90°. 
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Fig. 3 | Plasma-wave activity near the perihelion differs in quiet wind and 
jets. a, Spectral density measurements of electric field fluctuations near the 
electron plasma frequency f,. show intense bursts of electrostatic Langmuir 
waves with intensities around 107-10* V7 Hz ‘above the thermal background, 
suggesting the presence of electron beams. b, Electrostatic waves near the 
electron cyclotron frequency f.. (white dotted line) and its harmonics are often 
present inintervals of ambient radial magnetic field, but not present in jet 
plasma. c, A wavelet spectrogram of the magnetic field shows bursts of 
turbulent fluctuations with a distinct spectral break between 1 and 10 Hz 
associated with a transition to dissipation scales. The trace magnetic 


Alfvénic structures and waves have long been considered to be an 
important energy source for the solar wind” ”. The radial Poynting 
flux Sp = (Ex B)p/po (where fly is the vacuum permeability and E is the 
electric field; see Methods) in the spacecraft frame (Fig. 2b) is about 
10% of the kinetic energy flux (blue curve) and shows enhancements 
during the jet intervals, suggesting that these plasma jets may impart 
energy to the emerging solar wind. As seen in Figs. la, 2e, the plasma 
jets appear to be clustered and interspersed in an otherwise quiet solar 
wind flow with prominently radial magnetic field. 


Micro-instabilities and turbulence 

The quiet radial-flow intervals contain plasma waves consistent with 
expectations of micro-instabilities associated with ion” and electron” 
velocity-space structure (Fig. 3). The electric field spectrum from about 
11 kHz to about 1,688 kHz shows signatures of plasma quasi-thermal 
noise” (Fig. 3a) at the electron plasma frequency f,. (which is used 
to estimate the total plasma density in Fig. 2a). Intense bursts of nar- 
rowband electrostatic Langmuir waves (Fig. 3a) occur throughout 
the perihelion encounter; narrowband Langmuir waves are driven 
by electron beams and damp rapidly, suggesting the presence of an 
intermittent, local population of electron beams. 

The electric field spectrum (Fig. 3b) from 0.3 kHz to about 75 kHz 
shows intermittent bursts of electrostatic whistler-wave activity, 
peaked in power below the electron gyrofrequency f... Also present 
are waves containing harmonic structure consistent with electron- 
Bernstein-wave emission. Electrostatic whistler-Bernstein bursts” 
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field intensity (P,,) is divided by a factor of P, «f* such that a power spectrum 
with index —5/3 has no frequency dependence. d, Magnetic helicity (0,,, from 
the wavelet spectrogram) shows narrowband f,,</</,; + Up/U, (the expected 
Doppler-shifted frequency, where y, is the local Alfvén speed; dashed lines) 
signatures associated with ion cyclotron waves—when @,, is close to the 
maximum (+1) or the minimum (-1)—again in quiet radial solar wind.e, The 
normalized radial magnetic field B,/|B| shows distinct intervals of quiet radial- 
field wind, reduced turbulent levels and increased occurrence of electrostatic 
whistler and ion cyclotron instability. Measurements in Fig. 3 were made at 
approximately 00:00-03:00 November 5 2018 UT at about 36.6R,. 


are generated by features in the electron velocity distribution function 
f.(v) and are not observed in the solar wind at 1 AU. Here they occur 
only inthe quiet radial-field intervals. A wavelet spectrogram (divided 
by a function P, « f**) of the search coil magnetometer and fluxgate 
magnetometer data in Fig. 3c shows the spectral content of the mag- 
netic field to around 146 Hz. A spectral break between 1-10 Hz (in the 
spacecraft frame) is highly variable and associated with the transition 
from a magnetohydrodynamic turbulent cascade to dissipation or 
dispersion ranges at ion kinetic lengthscales**. Note that the overall 
turbulent levels are lower and more intermittent in the quiet radial 
wind (Figs. 3c, 4a). The spectrum of magnetic helicity” o,, in Fig. 3d 
indicates intervals of large (1>0,,>0.5, red; -0.5<o,,<—1, blue) circular 
polarization often associated with ion cyclotron waves*°. These ion 
wave events are apparent during quiet radial-field intervals. 

The (trace) magnetic field spectra (see Methods), averaged over 
30 min (Fig. 4a), show broken power-law behaviour, with spectral 
indices roughly comparable to the -5/3 and —8/3 predictions for 
magnetohydrodynamic and kinetic-scale turbulence’’, respectively. 
This suggests that by 36.6R,, the solar wind has already developed a 
turbulent cascade, to transport energy from large-scale motions to 
the microscale, where it can be dissipated. In the radial quiet wind 
(blue trace), where the turbulence level is substantially lower, an 
enhancement of wave power near the ion cyclotron frequency is 
observed. In the active jet wind (black trace), asteep spectrum is seen at 
the plasmaion inertial and gyroscales, indicating a transition to kinetic 
range turbulence and possibly the dissipation of turbulent energy to 
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Fig. 4 | Power spectral density and magnetic compressibility of magnetic 
field fluctuations in quiet and jet wind. a, Thirty-minute integrated power 
spectra of fluctuations in quiet (blue) and jet (black) solar wind conditions 
show the transition from a magnetohydrodynamic inertial range to 
dissipation- or dispersion-range turbulence, here compared to the spacecraft- 
frame frequency f*? and f*? power laws. The quiet wind spectrum (blue) 
shows increased power near the ion cyclotron frequency (f,,; dashed vertical 
line) that is associated with enhanced magnetic helicity (Fig. 3e). b, The ratio of 
magnitude (|B|) to tr(B) spectra indicates increased magnetic compressibility 
during jet intervals (black) compared to quiet wind (blue) up to the dissipation 
scale (aslope of -8/3). The ion cyclotron band corresponds to lower 
compressibility, as expected. 


heat the solar wind as it expands to fill the heliosphere. In both types 
of wind, the power levels are several orders of magnitude larger than 
at 1AU. The magnetic compressibility“—defined as C,,, = (6|B|/|6B|)’— 
shows an increase at high frequencies, as expected for kinetic range 
turbulence (Fig. 4b). At low frequencies, the compressibility is larger 
in jet wind than in quiet wind, but remains small (C,, less than about 
0.1), indicating that jet fluctuations have an enhanced compressible 
component but are still predominantly Alfvénic“. In the quiet wind, the 
band of enhanced power near the cyclotron frequency has a reduced 
magnetic compressibility, as expected for quasi-parallelion cyclotron 
waves*®, 

PSP encounter 1 reveals amore structured and dynamic solar wind 
thanis seen at 1 AU, with impulsive magnetic field reversals and plasma 
jets embedded ina quiet radial wind emerging from a small equatorial 
coronal hole. As PSP goes to lower altitudes, eventually to 9.8R, dur- 
ing the upcoming solar maximum, we expect to descend below the 
the altitude where the wind becomes super-Alfvénic (vg > Ua) and 
measure the interface between the corona and the solar wind for the 
first time. 
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Methods 


Heliocentric RTN coordinates 

We use so-called heliocentric RTN coordinates in our study. R points 
from the Sun centre to the spacecraft. T lies in the spacecraft plane 
(close to the ecliptic) and is defined as the cross product of the solar 
rotation axis with R. T points in the direction of prograde rotation. N 
completes a right-handed system. 


Statistical modes 

To examine the large-scale magnetic structure (Fig. 1b), we seek to 
remove the rapidly varying spikes observed in Fig. 1a. To do this we 
produce statistical modes which are defined by binning the full cadence 
magnetic field observations into one-hour intervals and for each inter- 
val, calculating the modal value—the peak of the histogram of field 
values within each interval. 


Identification ofjet intervals 

We calculate that approximately 6% of the duration of encounter 1 
consists of jet intervals. That number is computed by measuring the 
duration of positive polarity B, intervals (58,973 s) occurring from 30 
October 2018 to 11 November 2018 UT (103,6800 s total). This interval 
was chosen to correspond to the negative polarity interval centred on 
D (Fig. 1b), which has primarily negative polarity over the coronal hole, 
and does not have transient coronal mass ejection events. The positive- 
polarity jets were identified using a simple zero-crossing algorithm 
applied to 1-s cadence radial magnetic field data B,. Of course, not all 
so-called jets contain full polarity reversals. Biasing this calculation 
with an amplitude offset will produce a larger fraction of jet times; 
this is an ongoing study. 


PSP/FIELDS measurement details 

Measurements presented here were made by the FIELDS” and SWEAP”” 
instruments on the PSP spacecraft. Magnetic field measurements in 
Fig. la were made by the FIELDS fluxgate magnetometer and are aver- 
aged to 1-s cadence from their native cadence, which varies from about 
2.3 to 293 samples per second over encounter 1. The B, data shown in 
Fig. 1b are derived from the 1-s data by computing the distribution of 
amplitudes in one-hour intervals with an amplitude resolution of 1nT, 
and by finding the peak value of that distribution: that is, the statistical 
mode. This technique removes the fluctuating ‘jet’ intervals, without 
introducing the amplitude bias of an averaging algorithm. 

The magnetic field measurements in Fig. 2 start at 1-s cadence, aver- 
aged down from their native cadence as described above. All magnetic 
field measurements here are calibrated accurate to better than 0.5 nT. 
SWEAP velocity measurements are made by the Solar Probe Cup (SPC) 
sensor at acadence of about 1 measurement per 0.87 s and then aver- 
aged to 5-sintervals. The 1-s cadence magnetic field data are then aver- 
aged onto these 5-s time intervals. This reduces fluctuation noise in 
the SPC data and provides velocity and magnetic field measurements 
at the same cadence. The plasma density measurements in Fig. 1a are 
made using the FIELDS Low Frequency Receiver (LFR)*”, which measures 
the fluctuating electric field across the V1-V2 antenna pair” and com- 
putes the spectral density (also shown in Fig. 3a). The spectral peak is 
identified and associated with the electron plasma frequency f,.., as 
described previously’. Hence, the frequency of the peak amplitude 
gives a reliable estimate of the total plasma density. The spectral reso- 
lution of the LFR instrument is Af/f= 4%. The plasma frequency f,. is 
proportional to ,/n,, where n, is the electron (total) density; therefore 
the resulting uncertainty in the density measurement is 
An/n = 2Af/f= 8%. Electric field measurements used to compute the 
radial Poynting flux in Fig. 2b are measured directly as differential volt- 
age pairs*’ between the V1-V2 and V3-V4 antennas” and are then cali- 
brated to electric field units by comparison to —v x Bas computed from 
the SPC velocity and fluxgate magnetometer data. This enables us to 


remove spacecraft offset electric fields and compute an effective probe- 
separation length, a standard technique used to calibrate electric field 
instrumentation“. The electric field measurement is accurate to 
approximately 1mV m1. 

Measurements in Fig. 3a show the full spectrum of the RFS/LFR*” 
receiver in spectrogram form, as measured by the V1-V2 antenna pair. 
Wave intensity in Fig. 3a ranges from about 6 x10” to1.4 x 10° V7Hz™ 
and is represented logarithmically. The spectral bandwidth of the LFR 
receiver is Af/f= 4.5% and the cadence of the measurement is 1 spec- 
trum per 7 s. Figure 3b shows the electric field spectrogram of dif- 
ferential voltage measurements on the V1-V2 antenna pair from the 
Digital Fields Board (DFB) subsystem”, with intensity in arbitrary 
logarithmic amplitude units. The spectral resolution of this channel 
of the DFBis Af/f= ~6-12% and the measurement cadenceis 1spectrum 
per 5.5 s. Figure 3c shows the magnetic field spectrogram of search 
coil magnetometer measurements from DFB”, with intensity in arbi- 
trary logarithmic amplitude units. The wavelet spectrogram in Fig. 3d 
and magnetic helicity spectrum in Fig. 3e were computed using the 
wav data routine for the IDL programming language in the SPEDAS*® 
suite of IDL analysis routines. Wave intensity in Fig. 3c is represented in 
logarithmic power in arbitrary units and is divided by a factor P, «f*” 
(flattened), such that a power spectrum with spectral index —5/3 has 
no frequency dependence. 


PFSS modelling and connection to PSP 

Modelling the magnetic field time series (Fig. 1b) and tracing field lines 
from PSP down into the corona (Fig. 1c, d) was performed in two main 
steps. 


(1) PFSS implementation. PFSS”””° modelling used the recent open- 
source Python implementation pfsspy**”’. This code package is freely 
available online, extremely flexible with regard to changing the input 
parameters and efficient (a full PFSS solution can be extracted in about 
14sincluding downloading the magnetogram on demand). Given a mag- 
netogram and source-surface radius (R<<) as boundary conditions, the 
code solves the Laplace equation (equation (1)) for the magnetic scalar 
potential, ®,, and outputs a full three-dimensional magnetic field within 
the annular volume bounded by the photosphere and the source-surface 
parameter. The choice of magnetogram data and values of source-surface 
height depicted in Fig. 1 are discussed in Methods section ‘Choice of 
magnetogram data and source-surface radius for Fig. 1’ below. 


V70,(r) =0 (1) 


(2) Ballistic propagation. The procedure to magnetically connect 
PSP toa particular location at the outer boundary of the PFSS solution 
domain follows refs. ** °°, where the field line intersecting the position 
of PSPis assumed to followa Parker spiral! with a curvature determined 
by the co-temporal solar wind velocity measurement at that position. 
As discussed in ref. *°, although at lower radii this approximation is 
strongly perturbed by both corotational effects and the acceleration 
of the solar wind, these effects actually shift the coronal longitude bya 
similar magnitude but in opposite directions, resulting inan estimated 
error inlongitude of less than 10°. This produces a very simple mapping 
(equation (2)) from spacecraft spherical Carrington coordinates (psp, 
Opsp, Ppsp) to coordinates on the source surface (r, 8, @), involving the 
solar sidereal rotation rate, Q,, the measured solar wind speed, vp, and 
the source-surface height R,.. 


‘ Rss 
0 = Opsp (2) 
@ as 
Posp + vp, PsP ~ Rss) 
R 
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To generate time series predictions, we first download amagnetogram, 
choose a source-surface height and generate a PFSS solution using 
equation (1). We then take the trajectory of PSP and use equation (2) to 
produceatime series of latitudes and longitudes on the source surface 
to which PSP was connected (see red and blue trajectories in Fig. Ic, d, 
Extended Data Figs. 1-3). For each latitude and longitude we obtain a 
Bz value at the source surface from the PFSS model. Finally, we scale 
each B, value by C(R<s/Frpsp)? to produce an estimate of B, at the location 
of PSP as a function of time. Cis an empirically determined constant 
used to scale the time series prediction to match the peak measured 
magnetic field. Its value is dependent on the choice of magnetogram 
but approaches unity as the source-surface radius decreases and more 
flux is opened to the heliosphere. For the model results shown in Fig. 1, 
the values of C are 6.7 (2.0R, model) and 1.4 (1.2R, model). 

To produce field line traces and generate Fig. 1c, d, we start with the 
time series of latitudes and longitudes on the source surface connected 
to PSP. For each pair of coordinates, we use pfsspy’s built-in field line 
tracer. Given the output of the pfsspy model, we supply the source- 
surface latitudes and longitudes and the field line tracer generates a 
field line which starts from that point and propagates it down to the 
photosphere. The modelalso provides a polarity for each field line gen- 
erated, and we use these to colourize the field lines plotted in Fig. Ic, d. 


Choice of magnetogram data and source-surface radius for Fig. 1 
Synoptic maps of the photospheric magnetic field are available from 
multiple sources, which give variable outputs from the PFSS model. 
In this work we consider the NSO/GONG zero-point-corrected data 
product™, SDO/HMI vector magnetogram data product”, and the 
DeRosa (LMSAL) modelled magnetogram®. GONG has the advantage 
of being operationally certified for space weather predictions, SDO/ 
HMIis space-based and offers better resolution, and the DeRosa model 
assimilates HMI data and uses a surface flux transport and far-side 
helioseismological data to simulate photospheric dynamics such as 
differential rotation. 

Additional variation in the outputs of the PFSS model arises from the 
time evolution of photospheric observations. Synoptic magnetograms 
are built by many observations of the Sun from Earthas it rotates with 
an approximate 27-day period. Typically, only +60° longitude about the 
central meridian (sub-Earth point) are used for each observation (the 
grey regions in Extended Data Figs. 1-3). Although these maps can be 
updated with new data as frequently as observations are made, parts 
of the Sun facing away from Earth cannot be updated until they rotate 
into view, meaning all synoptic maps consist of a mix of old and new 
data and evolve in time. 

Finally, the model output depends considerably on the choice of 
the source-surface radius parameter (R,.). The inferred structure at 
the source surface changes as the source surface is lowered; implied 
structure suchas the PIL—the contour of B, = O—becomes more struc- 
tured and warped. The foot points of open field lines at the photosphere 
encompass larger areas, increasing the predicted size of coronal holes, 
and also increasing the total amount (both positive and negative) of 
magnetic flux crossing the source surface. 

Our approach to make robust conclusions is to generate model 
results for multiple times from all three magnetogram sources for 
varying source surface radii. Colour maps of B, at the source surface and 
the associated PILs are shown in Extended Data Figs. 1-3. The majority 
of models at 2.0R, and below predict polarity inversions in the vicinity 
of 240° and 310° longitude at all source-surface radii, with additional 
polarity inversions around 10° and 140° longitude that develop at lower 
source-surface radii. These features are all consistent with PSP meas- 
urements; we emphasize that they are largely independent of the time 
of observation and the choice of magnetogram source. Although the 


established” value of 2.5R, still gives good results from a GONG evalu- 
ation, boththe HMI and DeRosa models produce strong disagreement 
around the time of perihelion. In Fig. 1b-d we show results from the 
GONG zero-point-corrected map evaluated on 6 November 2018 UT 
about which our time range of analysis is symmetric. This evaluation 
shows all the above features and produces good time series agree- 
ments. We show source-surface radii of 2.0R, and 1.2R,. These lower 
source-surface radii do have modern precedent: 2.0R, is consistent 
with previous PFSS modelling done for the same interval”, where that 
radius was chosen to better match the observed extent of coronal holes. 
Another previous work” investigated the impact of lowering the source 
surface radius on model results, observing that at solar minimum a 
lower (<2.0R,) source-surface radius was required to populate equato- 
rial coronal holes with open field lines and improve their estimates of 
magnetic field strength at 1 Au. 
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Extended Data Fig. 1| Variation of PFSS neutral line topology with time and 
magnetogram choice at R,,=2.5R,. Colour maps of B, at the source surface 
from PFSS extractions with source-surface radius R.5=2.5R,. Red indicates 
positive polarity and blue indicates negative polarity. The black line shows the 
PIL (the contour of B,= 0). Superposedis the ballistically projected PSP 
trajectory coloured by the measured polarity. Perihelion occurred around 330° 
longitude. Left to right, the columns show extractions from the NSO/GONG, 
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SDO/HMI and DeRosaLMSAL models. From top to bottom, the models are 
evaluated at a weekly cadence spanning six weeks about perihelion, with input 
magnetograms from each source taken as close in time as possible. The grey 
shading shows the region +60° about the central meridian on the date of the 
model evaluation, indicating the portion of the Sun that could be observed at 
the time of observation. 
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Extended Data Fig. 2| Variation of PFSS neutral line topology with time and magnetogram choice at R,,=2.0R,. Colour maps of B, at the source surface from 
PFSS extractions with R,,=2.0R,. Other features are as described in Extended Data Fig. 1. 
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Extended Data Fig. 3| Variation of PFSS Neutral Line topology with time and magnetogram choice at R,,=1.2R,. Colour maps of B, at the source surface from 
PFSS extractions with R,,=1.2R,. Other features are as described in Extended Data Fig. 1. 
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Extended Data Fig. 4 | Synoptic maps of extreme-ultraviolet coronal the integrated plasma density squared along the line of sight. The dark regions 
emission from Carrington rotation 2,210, assembled from the STEREO-A/ in bothimages are probable locations of coronal holes, which are threaded by 
EUVI and SDO/AIA instruments. Top, 171-A data showing coronal Fe Ix open magnetic field lines that allow plasma to evacuate into interplanetary 
emission at around 600,000 K. This is the background of Fig. 1c, d. Bottom, space, resulting in under-dense regions. Carrington rotation 2,210 occurred 
193-A (AIA) and 195-A (EUVI) data showing emission from coronal Fe XII from 20:5126 October 2018 UT to 04:11 23 November 2018 UT. 
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Extended Data Fig. 5| During encounter 1, PSP connected magnetically toa 
small negative-polarity equatorial coronal hole. This schematic shows a 
potential field extrapolation of the solar magnetic field at the time of the first 
perihelion pass of PSP. The solar surface is shown, coloured by AIA 211-A 
extreme-ultraviolet emission (see Extended Data Fig. 4 for other wavelengths). 
Coronal holes appear asa lighter shade. Superposed are various field lines 
initialized at the solar disk. Black lines indicate closed loops, blue and red 
illustrate open field lines with negative and positive polarities, respectively. As 
depicted here and in Fig. 1c, d, at perihelion PSP connected toa negative 
equatorial coronal hole. The ‘switchbacks’ (the jets) observed by PSP (Fig. 1a) 
are illustrated as kinks in the open field lines emerging from the coronal hole 
that connect to PSP. (Note that neither the radial distance to the spacecraft nor 
the scale or amplitude of the jets or switchbacks are to scale.) Spacecraft 
graphic is courtesy of NASA/Johns Hopkins APL. 
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Heat transfer in solids is typically conducted through either electrons or atomic 
vibrations known as phonons. Ina vacuum, heat has long been thought to be 


transferred by radiation but not by phonons because of the lack of amedium’. 

Recent theory, however, has predicted that quantum fluctuations of electromagnetic 
fields could induce phonon coupling across a vacuum and thereby facilitate heat 
transfer? *. Revealing this unique quantum effect experimentally would bring 
fundamental insights to quantum thermodynamics? and practical implications to 
thermal management in nanometre-scale technologies®. Here we experimentally 
demonstrate heat transfer induced by quantum fluctuations between two objects 
separated by a vacuum gap. We use nanomechanical systems to realize strong phonon 
coupling through vacuum fluctuations, and observe the exchange of thermal energy 
between individual phonon modes. The experimental observation agrees well with 
our theoretical calculations and is unambiguously distinguished from other effects 
suchas near-field radiation and electrostatic interaction. Our discovery of phonon 
transport through quantum fluctuations represents a previously unknown 
mechanism of heat transfer in addition to the conventional conduction, convection 
and radiation. It paves the way for the exploitation of quantum vacuum in energy 
transport at the nanoscale. 


Quantum mechanics states that quantum fields are never at rest but 
fluctuate constantly, even at atemperature of absolute zero. These fluc- 
tuations lead to extraordinary physical consequences in many areas, 
ranging from atomic physics (for example, spontaneous emission and 
the Lamb shift’) to cosmology (for example, Hawking radiation). In 
1948, Casimir described a force that acts between neutral objects based 
on quantum fluctuations of electromagnetic fields’. This force is of both 
fundamental interest in quantum field theory and practical importance 
innanoscale and microscale technology”. Although the mechanical 
consequences of the Casimir effect have been extensively studied and 
precisely quantified” ”, its role in thermodynamics is rarely explored. 
Recently, it has been predicted that the Casimir effect can induce pho- 
nontransport between nearby objects and thus transfer heat through 
a vacuum gap? *. However, this intriguing quantum phenomenon has 
not been observed owing to stringent experimental requirements for 
nanometre gaps. At such small distances, other effects such as charge-— 
charge interactions’*”’, evanescent electric fields”° and surface phonon 
polaritons” may contribute and obscure experimental verification. 
Here we experimentally demonstrate heat transfer between two 
objects driven by quantum vacuum fluctuations. Using nanomechani- 
cal systems to access individual phonon modes and resonantly enhance 
the thermal energy exchange, we boost the distance range at which 
the phenomenon becomes observable by over two orders of magni- 
tude to hundreds of nanometres, compared to the nanometre to sub- 
nanometre range predicted for bulk solids? *. This allows us to single 
out the Casimir effect from other short-range effects. We quantify 


the temperature change of the phonon modes through their thermal 
Brownian motion and unambiguously show that the two phonon modes 
thermalize in the strong Casimir phonon coupling regime. Our result 
reveals anew mechanism of heat transfer through a quantum vacuum. 
It also opens up new opportunities for studying quantum thermody- 
namics and energy transport using nanomechanical devices. 

To illustrate the concept, we consider the interaction of two phonon 
modes based ona spring-mass model (shown in Fig. 1a). Two objects 
attached to springs are linked to thermal baths at different tempera- 
tures and undergo thermal Brownian motions. Displacement of the two 
objects perturbs the zero-point energy of the electromagnetic vacuum, 
giving rise to the Casimir interaction”. In the regime in which thermal 
Brownian motions of the objects are much slower than the response 
time of the Casimir interaction, the Casimir force acts instantaneously 
and is conservative in nature”. The Casimir interaction effectively 
acts as a coupling spring that connects the two objects, through which 
the hot object agitates the cold object. As a result, thermal energy is 
transferred across the phonon modes from the hot to the cold side. 

In the experimental setting, we use frequency-matched nanome- 
chanical oscillators to realize and resonantly enhance this Casimir 
heat transfer effect (Fig. 1b). Two parallel membrane resonators, 
each clamped to a substrate at different temperatures (7, and 7,), are 
separated by an adjustable distance, d. In the presence of the Casimir 
force, F..,(d), the system can be modelled as two coupled harmonic 
oscillators driven by Langevin forces from different temperature 
baths””*: ii, + 2y a, + Q7u, - 20g (u; — au) = OF;/m, wherei,j € (1, 23,14, 
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Fig. 1| Casimir heat transfer driven by quantum vacuum fluctuations. a, Asa 
conceptual illustration, we consider a spring-mass model in which two objects 
are separately linked toa hot and acold thermal bath. The hot (or cold) object 
has higher (or lower) thermal energy and therefore undergoes greater (or 
lesser) thermal Brownian motion. Owing to the Casimir interaction, the two 
objects are effectively linked by a coupling spring through which the rapid 
thermal motion of the hot object agitates the cold object. As aresult, thermal 
energy is transferred fromthe hot to the cold side. b, Inthe experimental 
setting, we use a pair of nanomechanical membrane resonators to demonstrate 
this mechanism of heat transfer. The two phonon modes (the fundamental 
modes of the membranes) have mode temperatures ( T;) that are determined by 
their thermal Brownian motions. The Casimir interaction facilitates thermal 
energy exchange between the two phonon modesat short distances, d.Asa 
result, the mode temperatures deviate from their bath temperatures ( T; #T7;). 


&= Fcas(d)/20p,is the coupling rate that arises from the Casimir force, 
and u;, Q, m;, Y;, &, P, and 6F; are respectively the displacement, reso- 
nance frequency, effective mass, dissipation rate, mode-matching 
factor, membrane area density and Langevin force. At large separation, 
where the Casimir interaction is negligible, the phonon modes of the 
membranes are in thermal equilibrium with their respective thermal 
baths, that is, T; = 7;, where kg; = m,Q7(u?) is the mode temperature 
determined by the thermal Brownian motion”. At short distances, the 
Casimir interaction dominates and induces thermal energy exchange 
between the phonon modes, manifested as an observable deviation 
of the mode temperatures from their bath temperature (see Supple- 
mentary Information section 1). 

We use optical interferometry to measure the thermal Brownian 
motion in order to determine the phonon mode temperatures (Fig. 2a). 
Using minimal laser power (8 pW) to avoid thermo-optical heating, we 
resolve the thermomechanical noise of the fundamental modes with 
asignal-to-background ratio of about 20 dB (Fig. 2e, f). The two high- 
stress stoichiometric Si,N, membranes of different dimensions 
(330 x 330 x 0.1 pm? and 280 x 280 x 0.1 1m?) are coated with gold 
(75 nm) on both sides for the purposes of optical reflection and 
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electrical contact (Fig. 2b, c). The dimensions of the two membranes 
are different such that their fundamental flexural mode frequencies 
can be matched at different temperatures by thermally tuning the 
membrane stress. At bath temperatures 7, = 287.0 K and 7, = 312.5 K, 
the resonances match at Q/21 = 191.6 kHz (Fig. 2d), with high quality 
factors of Q,=4.5 x10‘ and Q, =2.0 x 10*. A bias voltage (V,) is applied 
across the two membranes to compensate for any built-in electrostatic 
potential that may overwhelm the Casimir effect. 

Anessential experimental requirement here is to align the two planar 
resonators with a high degree of parallelism, which has been a hurdle 
for precision measurement of the Casimir force between planar struc- 
tures”. To solve the problem, we implement high-precision (below 
10‘ rad) membrane alignment using an optical interferometric tech- 
nique and an electrical method” with specific mesa structures and 
electrode patterning (Fig. 2b, c) (see Methods). This allows us to explore 
the Casimir interaction between two parallel planes separated by an 
unprecedented distance of around 300 nm (ref. ”). 

We observe the Casimir heat transfer between the phonon modes 
of the membranes (Fig. 3a). The mode temperatures show a strong 
dependence onthe distance. At large separations, the mode tempera- 
tures are the same as their thermal bath temperatures, while at small 
separations (less than 600 nm) they begin to deviate. As the distance 
is decreased further to below400 nm, 7,andT, become nearly identi- 
cal, showing thermalization of the two phonon modes. Such a heat- 
transfer effect is observed only when the resonance frequencies are 
matched within the linewidth, that is, when|Q, - Q,lis less than y,,y2.In 
the measurement, the mode temperatures are determined by their 
thermal Brownian motions. The mechanical motion can be decom- 
posed into u,(t) = X;()cosQt + ¥,(t)sinQt, with X(t) and Y,(¢) being the 
two quadrature components. The two measured quadrature compo- 
nents display a circularly symmetric distribution in the phase space, 
showing that the thermal motions are random with all phases being 
equally available (Fig. 3b, c). A plot of the probability distribution of 
the total energy F, = m,Q?(X? + Y?)/2 (Fig. 3d) shows that it follows the 
statistics of acanonical ensemble, thatis, P(E;) < e /ksT: The difference 
between T,and 7; determines the net heat flux flowing from the thermal 
bath to the phonon mode, given by P, = 2y,kg( 7; - T;).Fromthe measured 
mode temperatures (Fig. 3a), we obtain the averaged heat flux trans- 
ferred across the two thermal baths by P,,,= (P,— P,)/2 (Fig. 3e). 

The observed phenomenon can be quantitatively explained by the 
competition between the Casimir coupling rate (g,) andthe mode-bath 
thermal exchange rate (y; = Q/2Q;). When d decreases from 600 nm to 
350 nm, g;, increases rapidly and the system evolves from weak (g, « y;) 
to strong (g, > y;) Casimir phonon coupling regime (Extended Data 
Fig. 1). Using coupled-mode Langevin equations (see Supplementary 
Information, section 1), we derive the mode temperatures and the heat 
flux across the two thermal baths as: 


, yj( T; -T) 


fj - if ‘ Vij 
it yp(1+ Fe) 


2yv kp T;-T) 


where f. = 8_./a@a,is the effective coupling rate that accounts for mode 
mismatch. In the experiment, . /@a, = 0.97and therefore g_is approx- 
imately g,. The theoretical prediction well describes the experimental 
data (solid lines in Fig. 3a, e). When g, > y,, thermalization occurs 
(T,=T,) and the heat flux reaches a maximum value of 6.5x10J s7. 
Additional experimental results obtained from different samples and 
conditions are presented in Extended Data Fig. 2. In our theoretical 
analysis, we apply the proximity force approximation”’, whichis valid 
under the condition that the wavelength of the phonon modeis much 
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Fig. 2| Experimental setup and fabricated samples. a, Cross-sectional view of 
the experimental setup, showing two nanomechanical Si;N, membranes 
aligned in parallel and brought close together. Partially reflecting mirrors (M, 
and M.,) are placed behind the membranes, and laser beams (wavelength 

633 nm) are sent from both sides to interferometrically measure the 
thermomechanical motion of the membranes. The distances between the 
mirrors and the membranes are feedback controlled by piezo-actuators to 
maintain long-term stability of the interferometric detection sensitivity. The 
two samples are mounted onaclosed-loop thermoelectric cooler and heater to 
stabilize the sample temperatures and tune the mechanical resonance 
frequencies. The setup is kept ina vacuum below 10 © Torr. b,c, Optical images 
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ofthe samples. The Si,;N, membranes (100 nm) are coated with gold (75 nm) on 
both sides for optical reflection and electrical contact. The mesa structure on 
the left sample (b) and the electrodes onthe right sample (c) are fabricated for 
parallel alignment (see Methods). To compensate for the built-in electrostatic 
potential, a bias voltage V, is applied between the membrane surfaces. d, At 
room temperature, resonance frequencies of the two modes differ by around 
50 kHz owing to the difference in the dimensions of the membranes. By 
adjusting the sample temperature to tune the film stress, the two frequencies 
are matched at 0/27=191.6 kHz when bath temperatures reach 7, = 287.0 K and 


T, =312.5 K (black dashed line). e, f, Thermomechanical noise spectra of the 


fundamental modes. 
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Fig. 3 | Observation of Casimir heat transfer. a, Inthe presence of the Casimir 
interaction, the mode temperatures deviate from their bath temperatures 
when the two membranes are brought close. At distances below 400 nm, 7, and 
T, become nearly identical, showing thermalization of the two phonon modes. 
The mode temperatures are measured from the thermal Brownian motion, 
kgT; =m,Q?(u?). Error bars represent the standard error obtained from four 
hours of continuous measurement. The data agree well with calculations using 
coupled-mode Langevin equations (solid lines). b,c, Measured quadrature 
components (in picometres) of the thermal displacement of phonon modes1 
(b) and2(c) at7,=287.0K andT, =312.5K, respectively. Dashed lines indicate 


standard deviations of the distributions; the enclosed areas are proportional to 
the mode temperatures. d, Probability distributions of the phonon-mode 
energy follow the statistics of acanonical ensemble P(E,) « e Fi/ker: 
represented by solid lines. e, Heat flux transferred across the two thermal 
baths asa function of distance, extracted from the measured mode 
temperatures in panel a. The error bars originate from error propagation inthe 
calculation. The solid line represents the theoretical prediction of the coupled- 
mode model. Additional experimental results obtained from different samples 
and conditions are presented in Extended Data Fig. 2. 
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Fig. 4 | Strong phonon coupling through Casimir interaction. 

a, Thermomechanical noise spectrum of membrane 2 at d=400nm. Thetwo 
distinct branches result from strong coupling of the phonon modes. The upper 
(or lower) branch represents the even (or odd) mode where the two membranes 
move inthe same (or opposite) direction. b, Frequency splitting of the 
thermomechanical noise spectrum shows a parabolic dependence on the bias 
voltage between the membranes (solid curves are parabolic fits). The dashed 
line represents the sum of the phonon mode linewidths, above which the 
frequency splitting can be well resolved inthe spectra. The curvatures of the 
parabolas are proportional to the electrostatic interaction strength K;,(d), 


larger than the membrane separation (d/y is of the order of 107 in the 
experiment). For higher-order phonon modes with wavelengths com- 
parable to or smaller than the gap, modification of Casimir energy 
owing to the surface modulation of phonons needs to be considered”””. 

To verify that the observed heat transfer is due to the Casimir inter- 
action, we examine the phonon-mode coupling characteristics. When 
the membranes are brought close, we observe an anti-crossing feature 
in the thermal noise spectra, revealing strong coupling of the two 
modes (Fig. 4a). The frequency splitting Afof the two peaks is a direct 
indicator of the coupling strength. Taking both the Casimir and 
the electrostatic effects into consideration, we can express the 
frequency splitting as Af = Afcas + Afee, With Afcas = 2’c(d)/m and 
Afye =Ke(@) [Vp - Vo)? + V2._,.] being the Casimir and electrostatic com- 
ponents, respectively”. The coefficient K,(d) is proportional tod’, and 
V, and V,,,; represent the first and second moments of the surface 
potential difference between the two metallized membranes. The 
unique dependence of Af.,, on V, and d (Fig. 4b) allows calibration of 
the absolute distance between the two membranes®** (see Methods). 
We observe that the surface potential V, remains constant as distance 
is varied (see Extended Data Fig. 3), which agrees with the theoretical 
prediction for parallel plane configuration™. When V, is compensated 
by the applied bias voltage V,, we observe that the frequency splitting 
shows a distance dependence ofd 770" (Fig. 4c). This verifies that 
the Casimir effect dominates over the electrostatic effect in our meas- 
urement. This result also represents the first demonstration of strong 
phonon coupling induced by the Casimir force. 

Finally, we distinguish the observed Casimir heat transfer from ther- 
mal radiation effects. Near-field thermal radiation generates heat flow 
through the vacuum gap and slightly modifies the temperature of the 
bulk membranes. This leads to changes in membrane stress and thus 
frequency shifts of the phonon modes (see Methods and Extended 
Data Fig. 4). Unlike Casimir phonon coupling, the thermal radiation 
effect does not depend on the frequency matching of the two modes, 
which we verify by offsetting the frequencies of the two modes through 
thermal tuning. The observed frequency shifts are less than 40 Hzinthe 
distance range of our measurement. These frequency shifts correspond 
to temperature changes of less than 0.02 K, based on the measured 
frequency—temperature dependence of 2 kHz K" for the membrane 
modes (Fig. 2d). The slight temperature changes agree with our calcu- 
lation using the measured radiation heat transfer coefficient between 
gold surfaces®. On the other hand, thermal radiation pressure may also 
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which has a distance dependence of d? according to Coulomb’s law (shown in 
panelc, inset). This dependence is used to calibrate the absolute distance 
between the membranes”*"*"* (see Methods). c, The minimum frequency 
splitting for each distance shows a distance dependence ofd +?! This 
power law verifies that the Casimir interaction is dominant over the 
electrostatic interaction in our measurement. The inset plots K; whichhasad* 
dependence. In panels b,c, the frequency splitting is determined by fitting the 
two peak positions in the measured spectra, which gives a precision of roughly 
1Hz (smaller than the data markers). 


provide mechanical coupling between two phonon modes. However, 
such an effect is estimated to be negligible in our experimental condi- 
tion (less than 4% for distances shorter than 800 nm)” (see Extended 
Data Fig. le). 

In conclusion, we have experimentally demonstrated heat trans- 
fer driven by quantum fluctuations using nanomechanical devices. 
Our observation is unambiguously distinguished from other effects, 
including electrostatic interactions and near-field thermal radiation. 
Inthis work, we have focused on heat transfer through single-phonon 
modes. When the majority of phonon modes in a solid take part in 
the thermal exchange process, the effect generalizes to heat transfer 
between two bulk solids” *. The ability to control thermal flow witha 
quantum vacuum opens up a newarena for studying quantum ther- 
modynamics*** and for implementing quantum thermal machines°. 
Moreover, our method for achieving and controlling strong Casimir 
phonon coupling provides a versatile platform for implementing 
coherent phonon processes (for example, phonon state transfer and 
entanglement) using a quantum vacuum. 
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Methods 


Numerical calculation of the Casimir force 

We calculate the Casimir force between the membrane samples onthe 
basis of Lifshitz theory’. The calculation takes into account the finite 
conductivity and dispersion of the gold film and the geometry of the 
fabricated membrane structure (Extended Data Fig. 1a). The Casimir 
force per area can bewritten as Fc,,(d) = — nhcm?/240d*, where nis the 
correction factor applied to the Casimir force between two planar 
perfect conductors (Extended Data Fig. 1b). When the system enters 
the strong coupling regime (g, > y;, y2), thermalization between the 
two phonon modes (T; = T>) occurs (see Extended Data Fig. 1c, d). We 
also calculate the Casimir pressure caused by thermal fluctuations of 
the electromagnetic field (Extended Data Fig. le). In the distance 
range of our experiment (d < 800 nm), the thermal Casimir force (at 
300 K) is less than 4% of the Casimir pressure driven by quantum 
fluctuations. 


Device fabrication 

The process of device fabrications illustrated in Extended Data Fig. 5a-f. 
The process started with a silicon wafer (500 pm thick) coated with stoi- 
chiometric Si,N, (100 nm thick) on both sides by low-pressure chemical 
vapour deposition. Photolithography was performed at the back side 
of the wafer and SF, plasma etching was used to remove the Si,N, at the 
opening windows. The silicon wafer was then etched through ina KOH 
solution (25%, 80 °C, 7h), creating freestanding Si,;N, membranes at the 
front side of the wafer. For the right sample, the first photolithography 
and liftoff were performed to pattern contact electrodes (75nm Au) on 
the surface of the membrane. The second photolithography and liftoff 
were performed to pattern spacers (150 nm Au). After that, the back 
side of the sample was evaporated with 75 nm Au. For the left sample, 
photolithography and SF, plasma etching were performed to definea 
square region (500 x 500 ppm?) around the membrane, whose corners 
were designed to make contact with the spacers on the right sample. 
Ashort KOH (25%, 80 °C, 15 min) etch was used to create a mesa struc- 
ture with a depth of around 25 pm. After the KOH etch, the sample was 
evaporated with 75 nm Au on both sides. 

After fabrication, the left sample was attached to a custom-made 
copper plate using conductive silver paint (Extended Data Fig. 5i) and 
the right sample was mounted ona printed circuit board (PCB) with 
the on-chip electrodes wire-bonded to the corresponding contact pads 
(Extended Data Fig. 5)). 

Cleanliness of the membrane surfaces is crucial for parallel align- 
ment of the two membranes at a short distance. We inspected samples 
under a confocal microscope, which can identify particles with sizes 
down to 100 nm. To maintain a high degree of cleanliness, we carried 
out the sample fabrication, wire-bonding and mounting of samples 
onto sample holders ina clean-room environment. Using atomic force 
microscopy, we measured the surface roughness of the membrane to 
be less than 1.5 nm. 


Parallel alignment of the membrane samples 

A diagram of the parallel alignment setup is shown in Extended Data 
Fig. 6a. The two samples were designed such that the corners of the 
mesa structure onthe left sample align to the spacers onthe right sam- 
ple. The gold films coated on both sides of the membranes block the 
red detection laser (A = 633 nm) while allowing dim transmission of 
blue illumination froma high-brightness LED (A=460 nm). A bias volt- 
age, V,, is applied between the two membrane surfaces. When the two 
samples are brought close and touch each other at the spacer regions, 
anelectrical signal is picked up, pinpointing the corner that is touching. 
Simultaneous touching of the four corners indicates good alignment of 
parallelism. In the experiment, the distances at which the first corner 
touches and at which all four corners touch are within 80 nm. 


Simultaneously, the parallelism between the membrane is moni- 
tored optically by imaging the brightness distribution of the inter- 
ference pattern. Aligned membranes show uniform optical images 
when changing the distance (Extended Data Fig. 6b). By analysing the 
optical intensity at different locations of the membrane while changing 
the membrane separation, we obtain the relative tilting angles of the 
membranes with respect to the x and y axes as AO, = 22 + 25 prad and 
AO, = 43 + 24 prad (Extended Data Fig. 6d). For comparison, images 
of the misaligned membranes are shown in Extended Data Fig. 6c. 
Inthis case, A@,= 228 +33 prad and A@,=179 +39 prad (Extended Data 
Fig. 6e). 


Experimental setup and signal acquisition 

The experimental setup is illustrated in Extended Data Fig. 7a. Two 
laser beams split from an intensity stabilized He/Ne laser (Thorlabs, 
HRSO15B) are sent to the back sides of the membranes to interfero- 
metrically detect the thermal motion of the membranes. The distance 
between the mirror and the membrane is feedback-controlled by piezo- 
actuators, using the DC component of the reflected light (monitored 
by photodetectors) as a feedback signal. This is to achieve long-term 
stability of the detection sensitivity. The samples are mounted ona 
closed-loop thermoelectric heater/cooler, using a platinum resistance- 
temperature detector (RTD) as a temperature sensor connected to 
a temperature controller (Lakeshore 330). Feedback control of the 
bath temperatures allows stabilization and tuning of the mechanical 
frequencies. The right sample is mounted onacalibrated closed-loop 
piezo linear stage, which controls the relative distance between the 
membranes with a precision of around 5 nm. 

The AC optical signals are detected by avalanched photodetectors 
and fed to lock-in amplifiers (Stanford Research, SR840). The lock- 
in reference frequency is set to be offset by 50 Hz from the mechani- 
cal resonance in order to prevent spiking of the reference signal. The 
time sequences of the two quadrature components are recorded by 
a data-acquisition system. Fast Fourier transform is then performed 
to obtain the thermal spectra of the mechanical modes. This method 
allows faster measurement of the thermal spectra compared with the 
frequency-sweeping benchtop spectrum analyser. 

An optical image of the sample mount assembly and control stages 
is shown in Extended Data Fig. 7b. During the measurement, the whole 
assembly is kept ina vacuum chamber with pressure below 10° Torr. To 
avoid optical heating, laser powers are kept low at 8 pW before enter- 
ing the chamber. 


Electrostatic calibration of absolute distance between 
membranes 

The relative distance between the membranes, d’, is controlled bya 
closed-loop piezo linear stage using a strain-gauge sensor. To calibrate 
the absolute distance between the membranes, d= d’ + dy, a widely 
used method in Casimir force measurements is to make use of the 
unique distance and voltage dependence of the electrostatic effect’?*"*. 
In our experiment, the frequency splitting of the mechanical resonance 
due to electrostatic potentials follows Afie <KAMIY, - Vp)7+ V2.1, 
with electrostatic strength K,(d) x d ~.Ateach distance, the measured 
frequency splitting shows a parabolic dependence onthe bias voltage 
(Fig. 4b). We fit the parabola curvatures (electrostatic strength K,(d)) 
with a power law of (d’ + d,)°, using d, as the fitting parameter. Using 
this method, we determine the absolute distance between the mem- 
branes with a precision of around 5 nm (Extended Data Fig. 3a). From 
the fitting, we also obtain the surface potential, V), at each distance 
(Extended Data Fig. 3b). The surface potential remains constant at 
different distances, agreeing with the theoretical prediction for a 
parallel-planes configuration”. (A distance dependence in V, is 
expected for the sphere-plane configuration in other Casimir force 
experiments.) 


Thermal feedback control of mechanical frequencies 

Without feedback control, the mechanical frequencies typically drift 
at arate of around 0.3 Hz min“. During heat-transfer measurement, we 
apply feedback to control the bath temperatures and lock the mechani- 
cal resonance to a certain frequency. The frequency mismatch of the 
two modes can be maintained below 2 Hz, which is well below the 
linewidths of the two mechanical modes (4.6 Hz and 9.6 Hz) (Extended 
Data Fig. 8a). The time scale of the feedback loop is 16 s; the frequency 
is measured over 16 s and the heater/cooler power is adjusted at the 
end of this period. 

Throughout the measurement, the sample bath temperatures are 
monitored by RTD sensors. Bath temperature fluctuations are unre- 
solvable with the 0.01 K sensitivity of our temperature control system 
(Extended Data Fig. 8b). The bath temperature behaves similarly at all 
distances. This gives an upper bound for the temperature fluctuations 
of 6T na, = 0.01 K. To get a better estimation, we calculate 67 on the basis 
of the fluctuations in the mechanical frequencies (6f=2 Hz; Extended 
Data Fig. 8a) and the measured frequency-temperature coefficient 
df/dT (2kHz kK”) of the membrane modes (Fig. 2d). The estimated fluc- 
tuation of the bath temperature 67 is approximately 0.001 K. 


Near-field thermal radiation effects 

When thetwo membranes are brought close together, near-field ther- 
mal radiation could induce a deviation of the local temperatures onthe 
sample surface from the temperature of the sample holder. This would 
lead to a difference between the actual and measured bath tempera- 
tures. At separations greater than 300 nm, the radiation heat transfer 
coefficient between gold surfaces has been measured” to be less than 
1.4 W m’K7. Using the thermal conductivities of gold (150 W mK ‘at 
75 nm)” and silicon nitride (10 W mK)’ thin films, we calculate the 
local temperature deviation to be less than 0.02 K. 

The local temperature change due to thermal radiation modifies the 
membrane stress and therefore induces a frequency downshift (or 
upshift) of mechanical mode 1 (or 2) when the membranes are close. 
This thermal radiation effect does not depend onthe frequency match- 
ing of the two modes. To observe this effect, we first offset the frequen- 
cies of the two modes by 250 Hz by thermal tuning, and then fix the 
output of the heater and cooler (with feedback turned off). We note 
that the observed frequency shifts (Extended Data Fig. 4a) also include 
acontribution from the Casimir force (see Supplementary Information 
section 1). Sucha shift is equal to half of the frequency splitting in the 
Casimir strong coupling regime (see Supplementary Information equa- 
tion (S13)). We use the measured frequency splitting (Fig. 4c) to calcu- 
late the frequency shift caused by the Casimir force. For distances 
outside of the measured range, g- is extrapolated using the power law 
Eo% d *°!. The corrected frequency shifts are less than 40 Hz whend 
is greater than 300 nm (Extended Data Fig. 4b). On the basis of the 
measured frequency-temperature coefficient of 2 KHz K" for the 
membrane modes (Fig. 2d), these frequency shifts correspond to 


temperature changes of less than 0.02 K, which agrees with our calcu- 
lation. 


Stabilities of bias voltage and mechanical damping rates 
Throughout the measurement, the bias voltage V,, is applied to com- 
pensate for the surface potential V, at each separation. V, is sourced 
from a low-noise source meter (Keithley 2400) connected through 
an RC circuit, which serves as a potential divider and low-pass filter 
(see Supplementary Information section 3). We measured the noise 
spectral density of the source meter and estimated that its contribu- 
tion to the noise of V, reaches the thermal noise level at frequencies 
near the membrane resonance (182-194 kHz). 

We characterize the mechanical damping rates of the two phonon 
modes (Ay/y,, where i=1, 2) at different separations. The damping rates 
remain constant within the measurement error of + 4% in the whole 
distance range (Extended Data Fig. 9a). We also measure the damping 
rates when the bath temperatures are varied around the setpoints. 
With a temperature change of 0.3 K, the damping rates are constant 
within the measurement error of 4% (Extended Data Fig. 9b). Using the 
estimated bath temperature fluctuations (67=0.001K) obtained above, 
we estimate the temperature-induced fluctuations of mechanical 
damping to be less than 0.01%. 
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Extended Data Fig. 1| Numerical calculations of the Casimir force and its 
heat transfer effect. a, Cross-section of the layered structure used inthe 
experiment. b, Calculated correction factor, 7, plotted against distance, d.c, 
Calculated coupling rate, g,, plotted against d. d, Calculated mode 
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temperatures, 7, and7,, plotted against d on the basis of experimental 
condition 1in Extended Data Fig. 2a. e, Ratio between the Casimir pressures 
contributed from thermal fluctuations (F,,,) and quantum vacuum fluctuations 
(Fas) plotted against d. 
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Extended Data Fig. 2| Additional experimental results obtained from 
different samples and conditions. a, Summary of different experimental 
conditions used. Condition 1 corresponds to the experimental results 
presented in the main text. b-f, Measurement results obtained using 
conditions 2 and 3. Inall cases, phonon mode splitting is examined and 
confirms that the Casimir force is dominant over the distance range concerned. 
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b, Resonance frequencies versus bath temperature for sample set B (conditions 
2 and 3).c,d, Modetemperatures as functions of distances under different 
resonance-matching conditions. Error bars represent the standard error 
obtained from three hours of continuous measurement. e, f, Heat flux 
transferred across thermal baths as functions of distances. The error bars 
originate from error propagation in the calculation. 
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Extended Data Fig. 3| Electrostatic calibration of the absolute distance between membranes. a, b, Dependence of electrostatic strength (a) and minimum 
splitting voltage V, (b) on the distance between membranes. Inb, the error bars represent the error of the parabolic fit to the frequency splitting versus voltage. 
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shifts of the two modes with the contribution from the Casimir force excluded. 
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Extended Data Fig. 5| Device fabrication. a—h, Fabrication process flow for the left (a-d) and right (e-h) samples. i,j, The left (i) and right (j) samples are attached 
toacustom-made copper plate anda printed circuit board, respectively. 
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Extended Data Fig. 6 | Parallel alignment of the membranes. a, Schematic 
showing the parallel alignment setup. DAQ, data acquisition system. 

b,c, Transmission optical images of aligned (b) and misaligned (c) membranes. 
d,e, Optical intensity at different locations on the membranes (marked in b, c) 
as a function of the change in separation. Solid lines are sinusoidal fits with an 
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attenuation factor. The periodicity of around 230 nm matches well with the 
half-wavelength of the illumination light (460 nm). From the fitting, we find 
that the angle misalignments along the xandy directions are AO, = 22 +25 prad 
and A@,=43 +24 prad for aligned membranes (d), and A@, =228 + 33 prad and 
A@,=179 +39 rad for misaligned membranes (e). 
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Extended Data Fig. 7| Experimental setup. a, Schematic showing the experimental setup. APD, avalanche photodetector; BS, beam splitter; DC PD, DC 
photodetector; L, lens; M, mirror; ND, neutral density filter. b, Optical image of the sample mount assembly and control stages. 
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control. The shaded areas represent the linewidths of the mechanical modes. 
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Extended Data Fig. 9 | Characterization of mechanical damping rate. a, b, Relative change in damping rate(Ay,/y,) plotted against distance (a) and temperature 
change (b). Error bars represent the standard deviation of 100 measurements. 
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The macroscopic electromagnetic boundary conditions, which have been established 
for over acentury’, are essential for the understanding of photonics at macroscopic 
length scales. Even state-of-the-art nanoplasmonic studies? *, exemplars of extremely 
interface-localized fields, rely on their validity. This classical description, however, 
neglects the intrinsic electronic length scales (of the order of angstr6m) associated 
with interfaces, leading to considerable discrepancies between classical predictions 
and experimental observations in systems with deeply nanoscale feature sizes, which 
are typically evident below about 10 to 20 nanometres* ”°. The onset of these 
discrepancies has a mesoscopic character: it lies between the granular microscopic 
(electronic-scale) and continuous macroscopic (wavelength-scale) domains. Existing 
top-down phenomenological approaches deal only with individual aspects of these 
omissions, such as nonlocality” “ and local-response spill-out””». Alternatively, 
bottom-up first-principles approaches—for example, time-dependent density 
functional theory’*’’—are severely constrained by computational demands and thus 
become impractical for multiscale problems. Consequently, a general and unified 
framework for nanoscale electromagnetism remains absent. Here we introduce and 
experimentally demonstrate such a framework—amenable to both analytics and 
numerics, and applicable to multiscale problems—that reintroduces the electronic 
length scale via surface-response functions knownas Feibelman d parameters”. We 
establish an experimental procedure to measure these complex dispersive surface- 
response functions, using quasi-normal-mode perturbation theory and observations 
of pronounced nonclassical effects. We observe nonclassical spectral shifts in excess 
of 30 per cent and the breakdown of Kreibig-like broadening in a quintessential 
multiscale architecture: film-coupled nanoresonators, with feature sizes comparable 


to both the wavelength and the electronic length scale. Our results provide a general 
framework for modelling and understanding nanoscale (that is, all relevant length 
scales above about 1 nanometre) electromagnetic phenomena. 


We reintroduce the electronic length scale by amending the classical 
boundary conditions (Fig. 1a) with a set of mesoscopic complex sur- 
face-response functions, knownas the Feibelman d_, and d, parameters 
(Fig. 1b)'*, which play a role analogous to the local bulk permittivity, 
but for interfaces between two materials. d, and d, are equal to the 
frequency-dependent centroids of the induced charge and the nor- 
mal derivative of the tangential current, respectively, at an equivalent 
planar interface (Fig. 1c and Supplementary Information section S1). 
They enable a leading-order-accurate incorporation of nonlocality, 
spill-out and surface-enabled Landau damping (tunnelling” and size 
quantization”, which are not incorporated in the d parameters, are 
non-negligible at feature sizes below about 1nm; see Supplementary 
Information section S2.C). 


We start by summarizing the key elements of our framework. Thed 
parameters drive an effective nonclassical surface polarization 
P,=1 + iw K (Fig. 1d, Supplementary Information section S2.A)”!, 
with d, contributing an out-of-plane surface dipole density of 
T=) E,[E, | (€, is the vacuum permittivity) and d, contributing an 
in-plane surface current density of K =iwd|[Dj]. Here, [f] =f'-f 
denotes the discontinuity ofa field f (an electric E, magnetic B, displace- 
ment D or magnetizing H field oscillating at frequency w) across an 
interface 0O with outward normal fi (the superscripts ‘+’ and ‘—’ 
denote points immediately outside and inside the interface, that is, 
f'=f(r+0°f), wherer OQ). Similarly, f, =f-fand f,=(- fi')f denote 
the (scalar) perpendicular and (vectorial) parallel components of f 
relative to 0Q. These surface terms can be equivalently incorporated 
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Fig. 1| Framework, experimental structure and measured nonclassical 
shifts. a, b, Classical (a) and mesocopic (b) electromagnetic boundary 
conditions. c. Equilibrium and induced densities, n(r) and p(r, @) (not to scale) 
at ajellium-vacuum interface (Wigner-Seitz radius, r,= 3.93 atomic units; 

hw =1eV; h, reduced Planck constant) computed from (time-dependent) 
density functional theory. d, is the centroid of the induced charge. 

d, Nonclassical corrections can be formulated as self-consistent surface 
polarizations, representing the effective surface dipole density m(r) andthe 
current density K(r). e, Experimental structure: film-coupled Au nanodisks ona 


as a set of mesoscopic boundary conditions (here without external 
interface currents or charges) for the conventional macroscopic Max- 
well equations (also shown in Fig. 1b; see Supplementary Information 
section S2.B) 


D,]=- io "V,-K=d,¥,- [D\] (1a) 
BJ=0 (1b) 
E]=-e0Vr=- 4, VIE.) (1c) 
H]=Kxfi=iod|[D|] xf (1d) 


These mesoscopic boundary conditions are a twofold generali- 
zation from opposite directions. First, they generalize the usual mac- 
roscopic electromagnetic boundary conditions ([D,] =[B,]=0 
and[E,] = [H|] = 0) to which they reduce in the limit d, =d\=0. Secondly, 
they represent a conceptual and practical generalization of the appli- 
cability of the Feibelman d parameters, elevated from their original 
purview of planar’ and spherical” interfaces, and beyond recent qua- 
sistatic considerations”, toa fully general electrodynamic framework. 
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Au-Ti-Si substrate, separated by a nanoscale AlO, gapg (Siand Al nanodisks 
were also studied). f, Surface dipole density m(r) of the (1,1) gap plasmon ofa 
film-coupled Au nanodisk (D= 63 nm, g=4nm).g, Observation of large 
nonclassical corrections (spectral shift 2 400 nm) in film-coupled Au 
nanodisks (D=63 nm). Measured frequencies (circles) of the (1, 1) resonance 
blueshift relative to the classical prediction (dashed line), which quantitatively 
agree with nonclassical calculations (solid line, and the overlain calculated 
intensity map of the scattering efficiency o,,,/A, where A = 11D”/4). 


Experimentally, we establish a systematic approach to measure 
the d-parameter dispersion of a general two-material interface and 
illustrate it using Au-AlO, interfaces. Whereas the d parameters of 
simple metals can be accurately computed within jellium time-depend- 
ent density functional theory”, those of noble metals, such as Au, 
require time-dependent density functional theory beyond the jellium 
approximation owing to non-negligible screening from lower-lying 
orbitals”>7°, We show that the d parameters can instead be meas- 
ured experimentally. By developing and exploiting a quasi-normal- 
mode (QNM)-based”””’ perturbation expression, we translate these 
mesoscopic quantities directly into observables—spectral shifting 
and broadening—and measure them in designed plasmonic systems 
that exhibit pronounced nonclassical corrections. Our experimental 
testbed enables a direct procedure to extract the d parameters from 
standard dark-field spectroscopic measurements, ina manner analo- 
gous to ellipsometric measurements of the local bulk permittivity. 
Moreover, by investigating acomplementary hybrid plasmonic setup, 
we discover and experimentally demonstrate design principles for 
structures that are classically robust—that is, they exhibit minimal 
nonclassical corrections—even under nanoscopic conditions. 

The extensive interest in film-coupled nanoresonators**, which 
combine wavelength-scale resonators with nanometric gaps, is 
a particularly pertinent example that underscores the need for 
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Fig. 2| Measurement setup and sample micrographs. a, Tabletop dark-field 
scattering setup, where dark-field scattering from nanodisks illuminated bya 
broadband lamp is collected by acamera and a spectrometer (for details 

see Methods and Supplementary Information section S10). b, Dark-field optical 


cy Spectrometer 


c d 
micrograph of a Au nanodisk array (scale bar, 2 1m).c, Scanning electron 
microscopy image of asingle Au nanodisk (scale bar, 40 nm). d, Cross-sectional 


transmission electron microscopy image of an AIO, gap (scale bar, 10 nm). 
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Fig. 3 | Systematic measurement of the complex surface-response function 
d ,(@) of the Au-AlO, interface. a, Nonclassical perturbation strengths, 
calculated from QNM-based perturbation theory (equation (6)) ina film- 
coupled Au nanodisk (see inset of schematic inc, D=70.4nm); AU AlOx ig 
dominant. b, c, Measured (markers) dispersion of Re(d tt 4!) (b) and 
Im(dt" ~Al0,) (c) and their linear fits (lines). Gap sizes gare distinguished by 
colour, and diameters (D, ~ 82.9 nm, D,~ 70.4 nm and D;~ 63.0 nm) decrease to 
the right. Inset toc, sample structure schematic. d, Measured thickness- 


dependent refractive indices of bare AIO, films grown on Au for diameters D,, 


multiscale electrodynamic tools that incorporate nonclassical effects. 
We designed and fabricated film-coupled nanodisks (Figs. le, 2b—d) of 
various materials to verify our framework and directly measure the d 
parameters. Specifically, an optically thick Au film (atop a Si substrate) 
was separated from lithographically defined Au, Si or Al nanodisks (of 
diameter D) by ananoscale AlO, spacer, deposited by atomic layer depo- 
sition (ALD; see Supplementary Information section S5), demarcating 
a film-nanodisk gap of thickness g. The nanodisks support localized 
gap-plasmon resonances, which are (p, q) integer-indexable according 
to their field variations in the azimuthal and radial directions, respec- 
tively”. The fundamental mode (1, 1) is optically accessible in the far 
field and exhibits highly confined electromagnetic fields within the 
gap, suggesting potentially large nonclassical corrections. 

We implemented the mesoscopic boundary conditions (Fig. 1b) 
in a standard full-wave numerical solver (COMSOL Multiphysics, 
implementation available at https://github.com/yiy-mit/nanoEM; 
see Supplementary Information section S3). With specified d param- 
eters, this permits self-consistent calculations of, for example, the 
nonclassical surface dipole density m(r), as shown in Fig. 1f for the 
(1,1) mode. Similarly, conventional electromagnetic quantities 
suchas the scattering efficiency o,,.,/A (A =1D?/4) can be computed, 
enabling comparison with experimental results (Fig. 1g). For Au 
disks, the (1, 1) resonance is consistently blueshifted relative to 
the classical prediction, with shifts exceeding 30% for the smallest 
considered gaps. 
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D, and D3. e-j, Scattering efficiency (e, g, i) across distinct diameters and gap 
sizes and extracted complex (1, 1) resonance eigenfrequencies (markers; f, h,j). 
Whereas classical predictions (brown dashed lines) deviate considerably from 
the observations, our nonclassical calculations (black solid lines), employing 
the aforementioned linear a xg) fit, are in quantitative agreement across 
all diameters. Shadings indicate fit-derived confidence intervals for our 
calculations: 20~ 95% for Re(d st 4%) andRe(@) (b, f,h,j) and lo = 68% for 
Im(dAt 41%) andIm(@) (c, f,h,j).a.u., arbitrary units. In e-j, the diameter D of the 


Au sample used is shown above the panels. 


To extract the surface-response functions from observables, we 
develop a perturbative description® of the nonclassical spectral shift 
using the solver QNMEig”’ of the QNM freeware MAN (Modal Analysis 
of Nanoresonators), with retardation incorporated explicitly: the true 
eigenfrequency @ = 6+ G04 ...(eigenindex implicit) has a first-order 
nonclassical correction @™ to its classical value @ © (Supplementary 
Information section S4) 

B= OY Kid + Kid; (2) 
(a 
with mode-, shape- and scale-dependent nonclassical perturbation 
strengths (units of inverse length) 


Ki =-f 
1 Saat 


Here, Truns over all material interfaces such that | ),0Q' = 00, that is, 
te {Au-AlO,, Au-air} for our setup, and B® and E® denote the D and 
E fields of the (suitably normalized; see Supplementary Information 
section S$4.A) classical QNM under consideration. Conceptually, equa- 
tion (3) states that the nonclassical perturbation strength is propor- 
tional to the difference between the classical surface energy densities 
evaluated on either side of the interface. 

Figure 3a shows the magnitudes of the nonclassical perturbation 
strengths ina film-coupled Au nanodisk: x “1s, «4-3, «Al and 
ae Evidently, x} far exceeds x; forall gap sizes of interest, rendering 
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Fig. 4 | Robustness to nonclassical corrections. a, The nonclassical 
perturbation strength is one order of magnitude smaller in the hybrid Si-Au 
system than in its Au-Au counterpart. The Siand Au nanodisk diameters are 
chosen toensure spectral alignment of the (1, 1) resonance at every gap size 
(spanning D €[80 nm, 160 nm] and De [15nm,40 nm], respectively). 

b-d, Observation of robust optical response ina Si—-Au setup with the 


Au-air 


the impact of d; negligible. Similarly, theimpactofd| ~ isnegligible 
Au-AlO, Au-AlO. 


relative tod" “* because xj" “°>> «"""" for small g, and more 
generally because dt" “> q""" owing to screening from AlO, 
(Supplementary Information section $16)”. Jointly, this justifies the 


approximation 
Ox 6p AU-AIO, St-Al0, (4) 


Inversion of equation (4) enables the direct experimental inference 
of fae given the measured @ and calculated 6G (because, to 
first order, 0° ~ 6-6). We note that |Re(, ,)| > llm(K‘,,)| (by 1-2 
orders of magnitude) for the considered gap sizes; consequently, 
Re(d\, > contributes to spectral shifting and Im(d‘{,)) to broadening. 
Ensemble scattering spectra of 18 arrays of Aunanodisks (height, 31nm) 
were measured by dark-field scattering microscopy (Fig. 2, Methods), 
spanning three diameters and six gap sizes (Fig. 3e-j). The associated 
complex eigenfrequencies@ were subsequently extracted by Voigt pro- 
file deconvolution (Supplementary Information section S12), using the 
measured particle size distribution (Supplementary Information section 
S7). For the AlO, spacer, we observed ellipsometrically (and include in 
our calculations) a thickness-dependent refractive index n aio, (Fig. 3d, 
Supplementary Information section S9), an effect commonly observed 
in ultrathin ALD-grown AIO, layers” and other ALD-grown materials™. 
Figure 3b, c shows the complex surface-response function d abi)" 
extracted using equation (4). Within the considered spectral range, 
Re(d it A!) (Fig. 3b) reveals a nearly dispersionless surface response 
of comparatively large magnitude, from —0.5 nmto—0.4.nm. By contrast, 
Im(d At A!) (Fig. 3c) is strongly dispersive, increasing from <O0.1 nmin 


detrimental quantum corrections mitigated. The nonclassical calculation for 
the Si-Au setup assumes ah alos =-0.5+ 0.3inm,aconstant extrapolationto 
higher frequencies from Fig. 3b, c. Ind, the measured and calculated spectra 
are normalized separately. The calculated spectra incorporate 
inhomogeneous broadening (about 6%) due to disk-size inhomogeneity 


(Supplementary Information sections S7, S12.B). 


the near-infrared to about 0.3 nm inthe visible. The thickness depend- 
ence of ajo, imparts an attendant implicit dependence on the inferred 
att Ala) (Supplementary Information section S13). As a result, the 
frequency fits in Fig. 3b, c convey a composite dependence along the 
(@, Naio.) Space (Fig. 3d, circles) sampled by our data. 

Whereas the negative sign of Re(dit 0x) , and the associated 
blueshift of Re(@ — 6) (Fig. 3f, h, j, top panels), agrees with earlier 
observations in Au (refs. ”*) and Ag (refs. °°!°) nanoparticles, the spec- 
tral shift is substantially larger. There are two reasons for this: first, the 
nonclassical perturbation strength gee: is much larger than, for 
example, in standalone nanospheres or film-coupled nanospheres, 
owing to the strong field confinement beneath the entire nanodisk 
footprint (Supplementary Information section S4); second, screening 
fromthe AIO, cladding expels induced charge into Au, thereby enhanc- 
ing "bra relative to the unscreened interface, that is, relative to 
da" *" (Supplementary Information section S16)”. 

Nonclassical broadening due to surface-enhanced Landau damping, 
that is, Im(@- 6), is similarly enhanced for the same reasons 
(Fig. 3f, h, j, lower panels). Classically, the linewidth reduces nearly 
monotonically with gap size, primarily owing to increased light confine- 
ment (reduced radiative coupling). Instead, we observed (and predict, 
nonclassically) anear-constant broadening that is reduced slightly for 
very small gaps. This near-constant broadening results from an interplay 
(equation (4)) among the strong (classical) gap dependence of Re(@), 
the increase of the nonclassical perturbation strength (Fig. 3a) at 
smaller g values and the decrease ofIm(d‘t" “5 towards the infrared 
(Fig. 3c). Strikingly, the smallest gap does not produce the strongest 


nonclassical broadening (that is, Im(@)), in contrast to the natural 
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expectation of monotonically increasing Im(6™) with decreasing g. 
Instead, Im(@"?)is minimal there—a consequence of the near-vanishing 
magnitude of the strongly dispersivelm(a" “') (Fig. 3c). This behav- 
iour demonstrates the apparent breakdown of the empirical under- 
standing of nonclassical broadening in nanostructures, known as 
Kreibig damping’, which holds that Im(@”) is approximately propor- 
tional to 1/g. 

The observation of large nonclassical corrections in our coupled 
Au-Au setup frames the question: can nonclassical effects—which are 
often detrimental—be efficiently mitigated even in nanoscopic settings? 
To answer by example, we consider a hybrid dielectric-metal design, 
replacing the Au nanodisks with Si. Such hybrid configurations have 
been predicted to yield higher radiative efficiency with comparable 
overall plasmonic response” and have two key advantages for mitigat- 
ing nonclassical effects: first, undoped Siis effectively a purely classical 
material (thatis, ae ~ O)under the jellium approximation, as it lacks 
free electrons; second, high-index nanoresonators reduce the field 
intensity at the metal interface while maintaining confinement in the 
gap region. This hybridization can be exploited to reduce the nonclas- 
sical perturbation strength eee by an order of magnitude relative 
to that in the Au—Au design, as shown in Fig. 4a. Our measurements 
confirm this prediction: for Sinanodisks with D = 104.4 nm, we observe 
ahigh-quality scattering spectrum with asymmetric single-resonance 
feature for all gap sizes (Fig. 4b). The measured resonance frequencies 
(Fig. 4c) show only minor deviations from classical predictions, inboth 
real and imaginary parts. Although the inclusion of nonclassical effects 
improves the experimental agreement, the overall shift remains small 
and comparable to the uncertainties owing to the intrinsic oxide thick- 
ness beneath the Si nanodisk (Supplementary Information section 
S12.C). By considering a range of nanodisk diameters (Fig. 4d), we reach 
an identical conclusion, even for the smallest considered gap (about 
1.1nm): classical scattering spectra agree well with measurements, and 
nonclassical corrections are minor relative to those inthe Au-Au system. 
We found similar robustness across several additional gap sizes and 
diameters (Supplementary Information section S14). 

Equation (2) suggests a complementary strategy for mitigating non- 
classical effects: if the sign of Re(d i) differs at distinct interfaces T, the 
interface summationRe(5,, xd‘) will partially cancel. Whereas noble 
metals are known to spill inwards (Re(d,) <0), simple metals (for example, 
Al) spill outwards (Re(d_,) > 0)”. We found experimental evidence for such 
apartial cancellation ina combined noble-simple-metal setup (Al nano- 
disks ona Au substrate; Supplementary Information section S15). 

The mesoscopic framework presented here introduces a general 
approach for incorporating nonclassical effects in electromagnetic 
problems by asimple generalization of the associated boundary condi- 
tions. Our experiments show howto directly measure the nonclassical 
surface-response functions—the Feibelman d parameters—in general 
and technologically relevant plasmonic platforms. Our findings estab- 
lish the Feibelman d parameters as first-class response functions of gen- 
eral utility. This calls for the compilation of databases of d parameters 
at interfaces of photonic and plasmonic prominence, analogous and 
complementary to the existing databases of local bulk permittivities. 
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Methods 


Dark-field microscopy 

We built a table-top dark-field microscope (Fig. 2a) switchable between 
imaging and spectroscopy modes and with a100-700x variable zoom, 
to measure @ from the optical response of the samples (Fig. 2b). Opti- 
cal spectra were recorded at full zoom, capturing the scattered light 
from an ensemble of <100 nanodisks (ensemble—rather than single- 
particle—measurements are necessary owing to the associated low 
radiative efficiencies; see Supplementary Information section S10). 
Mutual coupling between nanodisks in the array is negligible, ensured 
by alattice periodicity of 2 um (in-plane filling factor <1%). This allows 
an isolated-particle treatment. 


Structural characterization 

The size distribution of the nanodisks was characterized systematically 
to adjust for the impact of inhomogeneous broadening in the meas- 
ured scattering spectrum from the ensemble (Fig. 2c, Supplementary 
Information section S7). We measured the AIO, gap size g using a 
variable-angle ultraviolet—visible ellipsometer and confirmed the 
results through cross-sectional transmission electron microscopy 
(see Fig. 2d and Supplementary Information section S8), finding good 
agreement with nominal ALD cycle expectations. A surface roughness 
of about 0.6 nm (root mean square) was measured for the Au substrate 
using atomic-force microscopy and was taken as the gap size uncer- 
tainty. Owing to the conformal nature of the ALD, such roughness 
should have negligible influence on the scattering spectra, as we 
verified by numerical simulations (Supplementary Information 
section S11). These detailed characterizations eliminate the main 
sources of geometric uncertainty in the mapping between calculated 
6 and measured @, which is a necessity for an accurate evaluation 


of the nonclassical shift 6 - 6. 


Data availability 


The data that support the plots within this paper and other findings of 
this study are available from the corresponding authors upon reason- 
able request. 
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Limiting the increase of CO, in the atmosphere is one of the largest challenges of our 
generation’. Because carbon capture and storage is one of the few viable technologies 
that can mitigate current CO, emissions’, much effort is focused on developing solid 
adsorbents that can efficiently capture CO, from flue gases emitted from 
anthropogenic sources’. One class of materials that has attracted considerable 
interest in this context is metal-organic frameworks (MOFs), in which the careful 
combination of organic ligands with metal-ion nodes can, in principle, give rise to 
innumerable structurally and chemically distinct nanoporous MOFs. However, many 
MOFs that are optimized for the separation of CO, from nitrogen*’ do not perform 
well when using realistic flue gas that contains water, because water competes with 
CO, for the same adsorption sites and thereby causes the materials to lose their 
selectivity. Although flue gases can be dried, this renders the capture process 
prohibitively expensive®’. Here we show that data mining of acomputational 
screening library of over 300,000 MOFs can identify different classes of strong CO,- 
binding sites—which we term ‘adsorbaphores’—that endow MOFs with CO,/N, 
selectivity that persists in wet flue gases. We subsequently synthesized two water- 
stable MOFs containing the most hydrophobic adsorbaphore, and found that their 
carbon-capture performance is not affected by water and outperforms that of some 
commercial materials. Testing the performance of these MOFs in an industrial setting 
and consideration of the full capture process—including the targeted CO, sink, such as 
geological storage or serving as a carbon source for the chemical industry—will be 
necessary to identify the optimal separation material. 


Different strategies have been developed to mitigate the negative 
effects of water onthe CO,/N, separation selectivity in MOF materials. 
For example, some MOFs have open metal sites at which amines can 
be attached, taking advantage of the specific amine chemistry that is 
also used in conventional amine scrubbing” ’. A previous screening 
study” investigated whether MOFs could adsorb CO, in the presence 
of water, and the results suggested that such MOFs could be de novo 
designed. Inthis work, we develop asystematic strategy for the design 
and preparation of custom-made MOFs that can capture carbon from 
wet flue gases. Our design methodology is inspired by the rational 
design of drug molecules, in which organic molecules that fit well into 
the binding pocket of a protein are mined from databases of known 
chemicals». The difference in our case is that the ‘drug molecule’ is 
known (that is, CO,), but the substrate that binds it optimally (the MOF) 


isnot. We therefore generated a library of 325,000 hypothetical MOFs, 
and screened each material for its CO,/N, selectivity and its CO, working 
capacity. The chemical building blocks used in the generation of these 
materials are shown in Extended Data Figs. 1and 2. Figure 1a shows that 
8,325 hypothetical materials possess a working capacity for CO, greater 
than2mmolg™‘andaCO,/N, selectivity greater than 50—performance 
that surpasses that of zeolite 13X under dry conditions”. 

A key part of drug design is to analyse the optimally binding mol- 
ecules for acommon feature or spatial arrangement of atoms at the 
binding site, which is known as the pharmacophore’. In analogy with 
this, we coin the term ‘adsorbaphore’ to describe the common pore 
shape and chemistry of a binding site in a MOF that provides optimal 
interactions to preferentially bind to a particular guest molecule, 
in this case CO,. From our top-ranked 8,325 materials, we identified 
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Fig. 1| Computational screening of MOFs for strong CO, adsorption and 
selectivity. a, Results from the screening of 325,000 hypothetical MOFs under 
conditions that mimic post-combustion capture (adsorption at ambient 
temperature and 1 bar witha molar ratio of 15:85 CO,/N, and regeneration at 
363 K and 0.1bar). The materials in the green box were selected for more 
refined screening and adsorbaphore identification; the colour-coding 
represents the number of MOFs according to the colour bar on the right. b, The 
H,0 affinity of the top-performing materials, as characterized by a plot of the 
Henry coefficients (K,,) for H,O against those of CO,. The colours represent the 
three different adsorbaphores found in the top-performing structures: Al, 
parallel aromatic rings; A2, metal-oxygen bridges; A3, open metal sites. Some 
materials have both Aland A2 sites. c, The adsorbaphore containing parallel 
aromatic rings (A1), which was discovered using the feature-recognition 
algorithm. 


106,680 such CO,-binding sites (see Extended Data Fig. 3 for examples). 
Asimilarity analysis of these binding sites revealed three main classes 
of adsorbaphore: A1, two parallel aromatic rings with interatomic spac- 
ings of approximately 7 A (31% of all binding sites); A2, metal-oxygen- 
metal bridges (32%); and A3, open metal sites (21%) (see Supplementary 
Information for details). Subsequently, we screened the materials that 
possessed these adsorbaphores for their affinity for water. Figure 1b 
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Fig. 2| The effects of water on different MOFs with the same adsorbaphore. 
a,b, Simulated adsorption of a ternary mixture of CO,/N,/H,O at 313 K by 
hypothetical MOFs m8067 (a) and m8071 (b). The partial pressure of CO, was 
held at 0.15 bar as the relative humidity was increased. The N, uptake was 
negligible and is not shown. c, d, Visualization of the water loading at 100% 
relative humidity in m8067 (c) and m8071 (d). The benzoate groups are 
represented by grey sticks; water is shown using red and white space-filling 
atoms. Inm8067, the benzoate groups perpendicular to the plane of the figure 
prevent hydrogen-bond formation across the adsorbaphores. 


shows the Henry coefficient for water in these high-performing mate- 
rials. Analysis of the data shows that the materials with the parallel 
aromatic rings (A1) have alow Henry coefficient for H,O, whereas those 
with metal-oxygen bridges (A2) and open metal sites (A3) tend to have 
higher Henry coefficients (Fig. 1b). A graphical representation of the dif- 
ferent adsorbaphores is presented in Extended Data Fig. 4. Comparison 
of the binding energies—computed using density functional theory—for 
the adsorbaphore shown in Fig. Ic indicates a preference for CO, (-10.2 
kcal mol’) over N, and H,O by 2.7 and 1.5 kcal mol, respectively (see 
Extended Data Table 1). The parallel aromatic rings provide a near- 
optimum interaction with all three atoms of CO,, whereas for H,O the 
lack of hydrogen-bonding sites limits its binding energy. 

The next step was to identify a subclass of MOFs in our library that 
contains the preferred adsorbaphore. From an experimental point of 
view, MOFs with the frz topology—characterized by tetra-carboxylated 
organic ligands coordinated to one-dimensional metal-oxygen rods— 
are an attractive starting point. One such example has been synthe- 
sized with indium as a metal node, resulting in a structurally stable, 
non-breathing MOF. In this topology, the metal rods provide an ideal 
scaffolding to which we can attach our adsorbaphore. By varying the 
metal ion we have some flexibility to tune the distance between the 
aromatic rings. Our calculations predict that the ideal adsorbaphore 
distance of 6.5-7.0 A—which was determined by adjusting the spacing 
of the aromatic rings incrementally (Extended Data Fig. 5)—can be 
approached if In(111) is replaced by Al(111) (Extended Data Table 2). In 
addition, aluminium is an attractive choice because it is an abundant 
metal and it ensures a strong bond with the carboxylate O-atoms of 
the ligands”; this considerably improves the thermal and hydrolytic 
stability of a MOF”, 

We generated alibrary of 35isoreticular materials using our MOF-gen- 
eration algorithm”, and from the mixture isotherms we computed the 


Intensity (a.u.) 


Fig. 3| Structural representation and stability of [Al-PMOF] and [AI- 
PyrMOF].a, b, Ball-and-stick representation of the structures of [Al-PMOF] (a) 
and [Al-PyrMOF] (b). The orientation of the tetracarboxylate ligands around 
the Al(II1) rods results in the generation of the three-dimensional non- 
interpenetrated structures containing the adsorbaphore (red box). Atom 
colour code: pink, Al; grey, C; blue, N; red, O; pale yellow, H.c,d, Laboratory 
powder X-ray diffraction patterns of [Al-PMOF] (c) and [Al-PyrMOF] (d). Black, 
simulated; red, as-synthesized material; blue, acetone-exchanged material; 
green, activated material; sky blue, activated material immersed in liquid water 
for 7 days; pink, activated material exposed toa controlled atmosphere of 
nitric acid vapour for 3h. 


CO,/N, selectivity of the materials in dry and wet flue gases (Extended 
Data Figs. 6 and 7). Our calculations show that all of our predicted mate- 
rials maintain an excellent selectivity at low pressures, and in about 75% 
of these materials the selectivity was not influenced by the presence 
of water under flue-gas conditions. The concept of an adsorbaphore 
focuses on the design of an adsorption site that optimizes selectivities 
at low pressure. At higher partial pressures of water, its adsorption is 
dominated by the energetics of hydrogen-bond formation. Further 
analyses showed that, for the materials that maintain a high CO, uptake 
at high humidity, it is the pore shape that frustrates the formation of 
these hydrogen bonds. This is illustrated in Fig. 2a, b, which compares 
the effect of water on the CO, uptake of two materials that have the 
same adsorbaphore but different pore structures (hypothetical MOFs 
m8067 and m8071). Figure 2a shows that m8067 is resistant to H,O 
flooding: even at a relative humidity of approximately 85%, we find 
that H,O has only a small effect on CO, capacity. Conversely, m8071 
completely loses its CO, capacity at 60% relative humidity (Fig. 2b). 
In Fig. 2c, d we visualize the hydrogen-bond network that is formed at 
100% relative humidity in both materials. For m8071 we see a complete 
hydrogen-bonding network (Fig. 2d), whereas for m8067 (Fig. 2c) we 
observe a less extensive network; the benzoate groups that separate 
the adsorbaphores frustrate the formation of a complete hydrogen- 
bonding network. 

On the basis of these predictions, we synthesized two frz-based 
MOFs using organic ligands that possess the water-frustrating prop- 
erties reported above: Al-PMOF’” (m8066) and Al-PyrMOF (m8067). 
These MOFs are based on one-dimensional rods of Al(1I1) linked by 


TCPP (tetrakis(4-carboxyphenyl)porphyrin) and TBAPy (1,3,6,8-tetrakis 
(p-benzoic acid) pyrene) ligands, respectively (Fig. 3a, b). Figure 3c, d 
shows no loss of crystallinity upon activation as well as upon expo- 
sure to different harsh conditions, including immersion in water 
for 7 days. 

By identifying adsorbaphores in these hypothetical materials, we 
assume that our in silico screening method can correctly predict the 
structure of a MOF, its adsorption properties, and the nature of the 
binding sites of CO, and H,O. We are able to test these assumptions 
for Al-PMOF and AI-PyrMOF. In Fig. 4a we show that the experimental 
and predicted CO, and N, adsorption isotherms are in good agreement. 
The CO, binding positions in the adsorbaphore and the effect of H,O 
are more challenging to observe experimentally. The siting of CO, 
was studied using in situ CO,-loading powder X-ray diffraction. Upon 
loading, we observed a considerable change in the intensity and peak 
position of the Bragg reflections (see Supplementary Fig. 2.1). Subse- 
quent Rietveld refinement and Fourier analysis” revealed the preferred 
locations of CO, in the pores of Al-PMOF, as shown in Fig. 4b. These 
results confirm that CO, preferentially adsorbs in the adsorbaphore. 

The effect of water on the siting of CO, has been further addressed by 
solid-state nuclear magnetic resonance (NMR) analysis. Under magic- 
angle spinning (MAS), high-resolution °C NMR chemical shifts are very 
sensitive to changes in the chemical environment. The °C NMR spectra 
of Al-PyrMOF and AI-PMOF are provided in Extended Data Fig. 8, which 
also shows the assignment of the peaks to specific atoms on the MOF. 
The chemical shifts associated with the atoms of the adsorbophore 
(inset) are shown in Fig. 4c as a function of the water concentration. At 
low levels the adsorbophore atoms experience no change in chemical 
environment with water loading, whereas at the highest water loadings 
there are modest changes inthe carbon-13 chemical shifts of only those 
atoms that are close to the aluminium-coordinated carboxylate groups 
next to the adsorbaphore (carbons B and F in Fig. 4c). This broaden- 
ing is consistent with dipolar broadening from proximate water mol- 
ecules, thus confirming that the adsorbaphore itself is not a preferential 
adsorption site for H,O. 

Our simulations predict that CO, adsorbed in the adsorbaphore is 
insulated from the adsorption of water. Because the 8C NMR spectrum 
of adsorbed ¥CO, is extremely sensitive to the proximity of water 
molecules in terms of chemical shift and line broadening, any disrup- 
tion of the chemical environment of adsorbed CO, by water should 
be immediately apparent. Figure 4d shows that the chemical shift 
of the adsorbed “CO, is independent of water content, although a 
broadening of the °C NMR peak is observed with increasing humidity. 
If this broadening is due to the proximity of the protons in water, it 
should disappear if the experiments are repeated using D,O; however, 
it does not (Fig. 4d). This observation corroborates our simulation 
results (Fig. 2c), confirming that water has only a limited effect on 
CO, adsorption in Al-PyrMOF. 

The ability of these materials to capture CO, from wet flue gases is 
of important practical concern. We therefore used a breakthrough 
experiment to determine the capture capacity of both Al-PMOF and 
Al-PyrMOF for a mixture of CO,/N, under dry- and humid-conditions* 
(Fig. 4e). These results confirm the predictions of the simulations 
(Extended Data Fig. 7): humidity in the flue gases has only a minimal 
influence on the capture capacity of Al-PMOF, whereas for Al-Pyr- 
MOF the capture capacity is in fact enhanced. Furthermore, repeated 
cycling” (Fig. 4f) does not result in degradation of the material or a 
change in separation performance. It is instructive to compare the 
performance of our materials with that ofa set of reference materials, 
including those that are commercially available—such as zeolite 13X 
and activated carbon—and a water stable, amino-functionalized MOF, 
UiO-66-NH.,. The capture capacity of these reference materials lies 
between that of Al-PyrMOF and AI-PMOF in dry flue gases; however, 
unlike our MOFs, their performance reduces considerably in humid 
flue gases. Although our materials do not have the highest reported 
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Fig. 4| CO, adsorption, °C cross-polarization MAS NMR and breakthrough 
experiments for [Al-PMOF] and [AI-PyrMOF]. a, Experimental (filled) and 
computational (open) single-component adsorption isotherms for CO, 
(squares) and N, (circles) adsorption collected on activated [Al-PMOF] (red) 
and [Al-PyrMOF] (blue) at 313 K. b, Rietveld refinement of the X-ray diffraction 
data (Supplementary Information) revealed that CO, binding in [Al-PMOF] 
occurs between the porphyrin cores—that is, inthe adsorbaphore.c, The 
linewidth of each carbon peak of the TBAPy ligand in [Al-PyrMOF] in the °C 
cross-polarization MAS spectrum, plotted as a function of relative humidity. 
Each carbon atom in the ligand is labelled in the inset. d, Linewidths extracted 
fromthe °C static NMR spectra of °CO, loaded in [Al-PyrMOF], plotted against 
relative humidity (RH). e, CO, capture capacity profiles for [Al-PyrMOF] and 
[AI-PMOF] during breakthrough experiments under dry and humid (85% 
relative humidity) conditions, with 85/15 v/v of N,/CO, (313 K and 1 bar). f, 
Benchmarking the CO, working capacity of [Al-PyrMOF] and [Al-PMOF] 
against UiO-66-NH,, activated carbon and zeolite 13X under dry and humid 
(85% relative humidity) conditions, with 85/15 v/v of N,/CO, (313 K and 1 bar). 
For wet flue gases, we studied the performance stability after 3 cycles for 
reference materials, and after 10 cycles for [Al-PyrMOF] and [AI-PMOF]. 


working capacities”, it is encouraging to see that, in wet flue gases, 
AI-PMOF outperforms commercial materials such as zeolite 13X and 
activated carbon. 

Large-scale screening of databases of hypothetical MOFs for vari- 
ous gas separation and storage applications has been reported pre- 
viously”* *’; however, here we have focused on identifying binding 
pockets—or structural motifs termed adsorbaphores—as synthetic tar- 
gets, rather than whole materials. This enhances the synthetic viability 
of the approach, as demonstrated by the identification of one new mate- 
rial with the targeted adsorbaphore that was synthesized and shown 
to adsorb CO, as predicted. The concept of linking computational 
screening with the synthesis of the corresponding materials through 
such adsorbaphores should be applicable to other gas separations of 
increasing complexity. 
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The computed data and hypothetical materials that were used in this 
Article are provided free of charge on the Materials Cloud (https://doi. 
org/10.24435/materialscloud:2018.0016/v3). An interactive version 
of Fig. 1a, b can also be found on this site. Data that are not included in 
the paper are available upon reasonable request to the corresponding 
authors. 


Code availability 


Topology Based Crystal Constructor (ToBasCCo), the Python program 
used to build hypothetical MOFs, is hosted on GitHub at https://github. 
com/peteboyd/tobascco. The Python code that compares common 
chemical features between fragments is also provided on GitHub at 
https://github.com/peteboyd/adsorbaphore and is dependent ona 
C library called MCQD which performs the maximum clique detec- 
tion of the chemical graphs. An interface between Python and C for 
this is provided here at https://github.com/peteboyd/mcqd_api. The 
Automatic Binding Site Locator (ABSL) program, used to identify CO, 
binding sites in each MOF, is part of a broader Python-based code used 
to facilitate simulations of porous materials called Fully Automated 
Adsorption Analysis in Porous Solids (FA?PS). This is available on Bit- 
Bucket at https://bitbucket.org/tdaff/automation. 
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Extended Data Fig. 1| Hypothetical material generation (1). The organic secondary building units (SBUs) used in the generation of the hypothetical MOF 
database. 
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Extended Data Fig. 3 | Examples of adsorbaphores. A selection of the most 
representative adsorbaphores obtained from visual inspection of the top 50 
most frequent adsorbaphores found from the random pairing method 
described inthe Supplementary Information. There are three major trends in 
the molecular fragments—labelled Al, A2 and A3—which can be observed upon 
descending each column. The number of chemical features of the fragments 
increases from left to right across each row. This is accomplished by increasing 


the minimum number of common atoms allowed during the substructure 
search, called the minimum clique threshold (MCT). Pictured in each 
adsorbaphoreis a representative contour map of the energy produced from 
CO, binding with the adsorbaphore atoms from each original CO,-binding site. 
a-c, Al, planar aromatic systems, in which CO, binds in between the stacked 
rings; d-f, A2, CO, binds near the bridging oxygen of a pillared vanadium SBU; 
g-i, A3, CO, binds between open-metal Cr SBUs. 
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found adsorbaphores from the maximum clique detection method; Al (a); A2 MOFs from the hypothetical database. The adsorbaphore atoms are 


(b) and A3 (c). Atom colours: grey, carbon; red, oxygen; green, vanadium; blue, highlighted in green. 
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Extended Data Fig. 5| Effect of the adsorbaphore spacer. Left, plot of the CO, spacing of the adsorbaphore atoms is adjusted by reassembling the structure 
adsorption at 0.15 bar and 313 K in hypothetical MOFs with the frz topology, with longer or shorter Al-O bonds. Right, representation of the spacing 
metal node number 8 and organic linker number 67 (m8067). The interplanar adjustments made tothe material. 


Extended Data Fig. 6 | Hypothetical MOFs built with the frz topology. These the adsorbaphore site in each material. No functional groups were used to 
structures contain an Al one-dimensional rod (m8) andthe organicligands60-  —_ decorate these materials. We refer to the synthesized versions of m8066 and 
82 from Extended Data Figs. 1and 2. The spacing between parallel aromatic m8067 as Al-PMOF (AI,(OH),(H,TCPP)) and Al-PyrMOF (AI,(OH),(TBAPy)), 
cores (seen in the centre of each hypothetical MOF) is around 6.7 A, defining respectively. 
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Extended Data Fig. 7| CO, adsorption capacity ofa class of frz-based 
hypothetical MOFs at 0.15 bar and 313K under ‘wet’ (85% relative humidity) 
and ‘dry’ flue gas conditions. Top, middle, hypothetical MOFs in which the 
organic ligand is connected to the Al ion via 4 benzoate moieties; bottom, 
hypothetical MOFs in which the organic ligand is connected to the Al metal ion 
via 4 acetylenic carboxylate moieties. The materials are ranked from lowest 
adsorbaphore density to highest, and the number onthe xaxis corresponds to 
the organic linker number (YY) in m8oYY (see Extended Data Fig. 6). 
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Extended Data Fig. 8 | NMRspectra.a,b, °C cross-polarizationMASspectrum _ letters labelling the peaks of the spectra correspond to the labels on the carbon 
of Al-PMOF (a) and Al-PyrMOF (b) recorded at 9.39 T with sample spinning at 8 atoms inthe structures onthe left. 
kHz; the contact time for the cross-polarization experiment was 2 ms. The 
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Extended Data Table 1 | DFT binding energies of gas particles in the adsorbaphore pocket of each MOF synthesized in this 
work 


Adsorbaphore Binding Energy (kcal/mol) 

CO2 H20 No 
Al-PMOF -12.6 -10.4 -7.9 
Al-PyrMOF -10.2 -8.7 -7.5 


MOF 


Extended Data Table 2 | DFT optimized cell parameters of 
hypothetical MOFs 


Metal species a (A) b (A) c (A) 
Al(IIl) 15.84 30.33 6.65 
Fe(III) 15.96 30.47 6.83 
Gai(IIl) 15.97 30.56 6.77 
In(lll) 16.23 30.88 7.27 
Sc(Ill) 16.12 30.70 7.34 
Y(IIl) 16.43 30.72 7.70 


The MOFs are built with the frz topology and organic linker number 67 (m8067) and various 
trivalent metal species. The c axis corresponds to the spacing between aromatic rings in the 
adsorbaphore. 
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Every night across the world’s oceans, numerous marine animals arrive at the surface 
of the ocean to feed on plankton after an upward migration of hundreds of metres. 
Just before sunrise, this migration is reversed and the animals return to their daytime 
residence in the dark mesopelagic zone (at a depth of 200-1,000 m). This daily 
excursion, referred to as diel vertical migration (DVM), is thought of primarily as an 
adaptation to avoid visual predators in the sunlit surface layer” and was first recorded 
using ship-net hauls nearly 200 years ago’. Nowadays, DVMs are routinely recorded by 
ship-mounted acoustic systems (for example, acoustic Doppler current profilers). 
These data show that night-time arrival and departure times are highly conserved 
across ocean regions‘ and that daytime descent depths increase with water clarity*”, 
indicating that animals have faster swimming speeds in clearer waters‘. However, 
after decades of acoustic measurements, vast ocean areas remain unsampled and 
places for which data are available typically provide information for only a few 
months, resulting in an incomplete understanding of DVMs. Addressing this issue is 
important, because DVMs have a crucial role in global ocean biogeochemistry. Night- 
time feeding at the surface and daytime metabolism of this food at depth provide an 
efficient pathway for carbon and nutrient export® ®. Here we use observations froma 
satellite-mounted light-detection-and-ranging (lidar) instrument to describe global 
distributions of an optical signal from DVM animals that arrive in the surface ocean at 
night. Our findings reveal that these animals generally constitute a greater fraction of 
total plankton abundance in the clear subtropical gyres, consistent with the idea that 
the avoidance of visual predators is an important life strategy in these regions. Total 
DVM biomass, on the other hand, is higher in more productive regions in which the 
availability of food is increased. Furthermore, the 10-year satellite record reveals 
significant temporal trends in DVM biomass and correlated variations in DVM biomass 
and surface productivity. These results provide a detailed view of DVM activities 
globally anda path for refining the quantification of their biogeochemical 
importance. 


For decades, airplane-mounted lidar instruments have used the back- 
scattering of light (bbp) from phytoplankton, zooplankton and small 
fish to locally map the distribution of these organisms in the water 
column? ”. In contrast to passive ocean sensors that measure reflected 
sunlight, lidar uses lasers as a light source and thus has the capacity 
to measure marine organisms both during the day and at night. When 


DVM animals are prominent, their nocturnal invasion of the surface 
ocean is expected to increase bbp at night compared with what the 
bbp would be in the absence of these animals. Five ship-based exam- 
ples” of this DVM signature are shown in Fig. 1a, in which the animal 
signals contribute 7-28% of total night-time bbp. These signals appear 
as large spikes in this record because DVM animals are ‘bright’ targets 
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Fig. 1| Marine animal and phytoplankton influences on day-to-night 
changes in particulate backscattering coefficients and the biomass- 
normalized difference ratio. a, Ship-based time-series of bbp showing 
nocturnal spikes associated with DVM animals and the general lack of these 
spikes during the day” (Methods). Black line, complete bbp record. Purple line, 
despiked baseline bbp record. b, Typical diel cycle in bbp of phytoplankton” 
(Methods). Red dotted line, daily equator crossing times of CALIOP. a, b, Black 
and white bars onthe x axis indicate night and day, respectively. c, Biomass- 
normalized difference ratios (Abbp) for ranges in DVM (xaxis) and 


compared to the phytoplankton and other suspended particles 
that constitute the much lower bbp baseline” (Fig. 1a); however, 
animals are also rarer and only occasionally pass through the 
small sample volume (around 2 ml) of the ship’s bbp instru- 
ment. By contrast, a lidar with a much larger sampling volume can 
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Fig. 2| Global climatological signal of vertically migrating animals 
quantified as the normalized difference ratio. Data are 2° x 2° bin means for 
the 2008-2017 CALIOP record. Yellow-and-black line, contour of annual mean 
sea surface temperature of 15 °C, which effectively separates high-latitude 
seasonal seas from lower-latitude permanently stratified oceans””*°. Thick 
black lines separate major regions of the PSO (Fig. 3), in which the 5 subtropical 
gyres are defined by annual mean surface chlorophyll concentrations of 
<0.08 mg m*®. Dark grey, excluded pixels in which the water column depthis 
lessthan1,000m. 
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phytoplankton (yaxis) contributions to day-to-night bbp changes. For 
example, ifthe phytoplankton diel cycle corresponds (asin b) toa15% day-to- 
night decrease in bbp at the two CALIOP measurement times (horizontal white 
dashed arrow) and this decrease is countered by a5% increase in bbp from DVM 
animals (vertical white dashed arrow), then Abbp will havea value of -10% 
(colour inside labelled white circle). Diagonal black lines correspond tothe 
colour bar range for the CALIOP observations shown in Fig. 2. Note that values 
onthe y axis encompass the range of values expected in the PSO (Extended 
Data Fig. 5 and Supplementary Discussion). 


effectively capture the signal of all scattering components with every 
measurement. The satellite cloud-—aerosol lidar with orthogonal 
polarization (CALIOP) sensor has been conducting such measurements 
for more than a decade” (Methods) and its bbp retrievals provide 
an opportunity to decipher global patterns in vertically migrating 
animals. 

The laser footprint of CALIOP has a diameter of 100 m at the ocean 
surface anda vertical sampling depth of 22 m in water. What this means 
is that each CALIOP measurement integrates the bbp signal froma water 
volume of 1.73 x 10° m3, which is approximately 5 x 10° greater than the 
entire water volume measured over a given night in the field data shown 
in Fig. 1a. The large sampling volume of CALIOP thus ensures that each 
retrieved bbp value encompasses both the animals and suspended cells 
and particles in the surface layer. CALIOP is a polar orbiting sensor 
that conducts daytime and night-time (around 13:40 and 01:40 local 
time, respectively) near-nadir backscattering measurements along its 
orbit track at asampling frequency that is equivalent to every 330 m 
on the ground. A slightly precessing orbit with a 16-day repeat cycle 
provides global bbp coverage, but ground tracks oriented in opposite 
directions onthe light and dark sides of Earth mean that day and night 
bbp samples are rarely spatially coincident within a given 24 h period 
(Extended Data Fig. 1and Supplementary Discussion). The global sig- 
nature of DVM animals was therefore investigated by creating 2° x 2° 
(latitude by longitude) binned monthly CALIOP daytime (bbp*”) and 
night-time (bbp"®"’) values of bbp. 

In the absence of any DVM, the biomass-normalized bbp differ- 
ence ratio (equation (1)) is expected to yield a negative value for the 
measurement times of CALIOP (Fig. 1b): 


Abbp = (bbp"*" - bbp*”)/bbp” (1) 
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Fig. 3 | Comparison of CALIOP normalized difference ratios and field-based 
DVM measurements inthe PSO. a, Colour-coded regions of the PSO from 

Fig. 2. Yellow star, location of the HOT site. Orange star, location of the BATS 
site. b, CALIOP (Abbp) versus field ADCP (AB,pcp) normalized difference ratios. 
Red line, two-sided least-squares linear regression fit to all 5° x 5° bins within 
our 8 PSO regions (slope = 0.42; F-test, P= 0.001, n=331independent 
geographical bins) (Extended Data Figs. 7,8 and Supplementary Discussion). 
Symbols, regional mean values of Abbp and ABapcp, With s.e.m. shown for each 
region (SISG, n=19; SASG, n=18; TP, n=115; NASG, n=16; NTA, n= 23; NPSG, 
n=59;STA,n=22; SPSG,n=59; other, 302). For completeness, the mean value 
for all PSO bins outside our eight primary regions is indicated by the white 
symbol. Numbers next to each symbol indicate the median number of days with 
ADCP data within the 32-year field record for the bins within a given region. The 
dashed line indicates the two-sided least-squares linear regression fit to 
regionally averaged data for then=6 primary regions in which ABapcpis 
significantly greater than zero (TP, NASG, NTA, NPSG, STA and SPSG) 

(slope =0.64, r?=0.55, P=0.09). c, Mean monthly field-measured (1994-2005) 
migratory zooplankton biomass (g dry weight m”) for the HOT site (Methods) 
(n=112 net haul samples) and CALIOP-measured Abbp (2008-2017) (n=75 
monthly retrievals) for a2° x 2° bin centred on the HOT site. Vertical lines 
indicate 1s.d. Field and CALIOP data are offset on the x axis by 6 days to 
separates.d. lines. d, Asinc, for field (1994-2017) (n=285 net haul samples) and 
CALIOP (n=87 monthly retrievals) data for the BATS site. 


This is because daytime increases in phytoplankton cell size and 
organic carbon content and night-time cell division and metabo- 
lism’*"8 create a baseline diel cycle in biomass-normalized bbp with an 
afternoon maximum and night-time minimum” (Fig. 1b and Extended 
Data Fig. 9). The presence of DVM animals increases bbp"®"' (Fig. 1a) 


and therefore causes Abbp to become less negative than the phyto- 
plankton-only signal or even positive if the migrating animals are 
sufficiently abundant. Thus, phytoplankton and DVM animals 
together determine Abbp (Fig. 1c), in which the magnitude and sign 
of Abbp is an index of the DVM signal strength relative to that of the 
phytoplankton. 

The 2008-2017 CALIOP record reveals eight broad regions at tropi- 
cal and subtropical latitudes (equatorward of the yellow-and-black 
lines in Fig. 2) where climatological mean Abbp values are spatially 
coherent. In these regions, plankton populations are relatively stable 
over time as the water column is permanently stratified within the 
euphotic zone (referred to hereafter as the permanently stratified 
ocean (PSO)). These eight broad features (Fig. 2) correspond to the 
seasonally varying boundaries of the North Pacific (NPSG), South 
Pacific (SPSG), North Atlantic (NASG), South Atlantic (SASG) and South 
Indian (SISG) subtropical gyres, and the higher-nutrient tropical Pacific 
(TP), north tropical (NTA) and south tropical (STA) Atlantic (Figs. 2, 
3a and Extended Data Fig. 2). Within these regions, Abbp values are 
persistently positive in the NPSG, persistently negative in the TP and 
NTA, and strongly seasonal in the NASG, STA and all three Pacific areas 
(Extended Data Fig. 3). Inthe high-latitude seasonal regions (poleward 
of the yellow-and-black line in Fig. 2), the CALIOP record reveals large 
climatological mean values for Abbp. However, persistent cloud cover 
and small-scale spatiotemporal variations in plankton populations in 
these regions cause within-bin mismatches in bbp® and bbp"2" data 
that yield significant bin-to-bin variability in Abbp (Supplementary 
Discussion). From this point forward, therefore, our analysis primarily 
focuses on the eight regions of the PSO. 

If variability in Abbp is mostly a reflection of the strength of the DVM 
rather than changes in the phytoplankton cycle, then results shown 
in Fig. 2 suggest that there are significant regional differences in the 
relative night-time abundance of these animals. To test this DVM-basis 
of the regional patterns, we compared CALIOP data to historical field 
acoustic Doppler current profiler (ADCP) observations of paired day 
(Iyg*%) and night (/4,"2"") acoustic backscatter. Owing to the sparsity 
of such data, it was necessary to aggregate 32 years (1985-2017) of 
field measurements into monthly 5° x 5° bins to achieve reasonable 
global coverage (Methods). We then calculated median values of the 
normalized difference ratio: 


ABapcp = (lap"2"* 7 lap) lap (2) 


CALIOP bbp data were then reaggregated into equivalent 5° x 5° bins 
and regional median values of Abbp were calculated using only those 
calendar months and bins for which ADCP data were available. Compari- 
son of ABapcp and Abbp for all bins within our eight PSO regions yielded 
astatistically significant relationship (P=0.001, n=331) (Fig. 3b) with 
aslope that is very similar to that calculated from regionally averaged 
values for the six regions in which ADCP data identify a significant 
DVM signal (Fig. 3b and Supplementary Discussion). In other words, 
the ADCP and lidar datasets both indicate that DVM animals constitute 
agreater fraction of night-time plankton communities in the optically 
clear subtropical gyres that are most advantageous to visual predators. 
Notably, the regionally averaged data for the SISG and SASG deviate 
from the other PSO regions (Fig. 3b). The reason for this difference is 
unknown. These are the only two regions in which AB,pcp is near zero, 
suggesting either that there are few DVM animals or that the influx of 
night-time DVM animals is compensated by reverse-DVM animals that 
leave the surface layer”’. By contrast, CALIOP data indicate that there 
is amoderate DVM signal in the SISG and SASG (Fig. 3b). Perhaps the 
discrepancy in these two regions is simply due to poor ADCP coverage 
(typically one day-night observation per 5° x 5° bin for the 32-year field 
record). However, all of the ADCP data in the SASG and all but eight 
observations in the SISG predate the CALIOP mission, so a temporal 
change in DVM populations cannot be ruled out. 
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Fig. 4 | CALIOP-based estimates of vertically migrating animal biomass 
(DVM arjop) and temporal changes. a, Annual mean DVM aj jop (g dry weight 
m7). Yellow-and-black and thick black lines as in Fig. 2. b, Relationship between 
annual mean regional DVM ¢a,;op and mixed-layer net primary production data 
from the carbon-based production model (NPPpy; mg Cm? day”) (Methods). 
Solid line, two-sided least-squares linear regression fit (r?= 0.80; F-test, 
P=0.002,n=9 geographical regions) of the PSO regions, which are colour- 
coded according to Fig. 3a. Black symbols, high-latitude North Pacific (NP), 
North Atlantic (NA) and Southern Ocean (SO) (see c). Horizontal and vertical 
lines indicate 1s.d. for annual values between 2008 and 2017 (n=111 months per 


Day and night zooplankton net tows provide quantitative field 
measurements of DVM biomass, but these labour-intensive measure- 
ments are rarely collected at regular intervals over long periods. Two 
exceptions in the PSO are the sustained records at the Hawaii ocean 
time-series (HOT) site (22° 45’ N, 158° W)” and the Bermuda Atlantic 
time-series (BATS) site (31° 40’ N, 64° 10’ W)” (Fig. 3a). Despite consid- 
erable interannual variability, significant (P< 0.01) seasonal cycles in 
DVM biomass (g dry weight m7”) are observed at these locations. For the 
CALIOP bin centred on the HOT site, the seasonal cycle in Abbp from 
January to November is significantly correlated (7? = 0.44, P= 0.03) 
with field observations (Fig. 3c). Similarly, Abbp for the CALIOP bin 
centred on the BATS site exhibits a two-peaked seasonal cycle that 
correlates well with field observations from August to June (7 = 0.42, 
P=0.03) (Fig. 3d). These results are of particular note given the tem- 
poral and spatial contrast between datasets (CALIOP data are from 
2° x 2° bins collected from 2008 to 2017; zooplankton net data were 
collected at HOT and BATS from 1994 to 2005” and from1994 to 2017, 
respectively). Thus, the correspondence between Abbp (whichincludes 
DVM animals, surface-resident organisms and suspended particles) 
and animal-specific ADCP” (Fig. 3b) and net measurements (Fig. 3c, 
d) gives confidence to the DVM signal detected by CALIOP. 

The biomass-normalized Abbp property reflects the relative strength 
of the DVM signal, whereas DVM biomass (DVM carjop) is More quantita- 
tively related to the simple difference, bbp"®"* - bbp™. We estimated 
DVMcaniop from this difference as: 


DVMcariop = a(bbp"="" - cbbp™) (3) 


in which ais an empirical conversion factor between backscattering 
(m7) and biomass (g m~*) determined from BATS and CALIOP dataand 
caccounts for night-day differences in bbp from growth-rate- and 
day-length-dependent variations in the diel cycle of phytoplankton 
(Methods). Application of equation (3) to the CALIOP record yields 
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geographical region). c, Rate of change in DVM¢a_ 1p for 6° x 6° bins for eight 
PSO regions and at the HOT and BATS sites (coloured stars). Only bins that stay 
within their respective regional boundaries throughout the year are shown 
(Extended Data Fig. 2). A black dot in centre of bin indicates that the trend is 
significant (two-sided least-squares linear regression; F-test, P< 0.05,n=111 
months per bin). Outlined bins exhibit a significant (two-sided least-squares 
linear regression; F-test P< 0.05, n=111 months per bin) relationship between 
DVM atop and NPPcypy, With black and green outlines indicating positive and 
negative correlations, respectively. 


a global distribution of DVM¢ariop (Fig. 4a) that differs markedly from 
Abbp (Fig. 2). In particular, DVMcarjop is generally lowin the subtropical 
gyres and high in the nutrient-enriched tropical regions. Therefore, 
although the relative contribution of DVM animals to night-time plank- 
toncommunities is higher in clearer waters (Fig. 2), total DVM biomass is 
higher in more productive regions in which thereis a greater availability 
of food (Fig. 4a). Indeed, the annual mean DVM, op for our PSO regions 
is highly correlated with passive ocean-colour-based estimates of net 
primary production (7 = 0.80, P= 0.002) (Fig. 4b, Methods). Results 
for the high-latitude northern and southern regions are also consist- 
ent with this finding (Fig. 4b). Notably, an increase in phytoplankton 
production yields a less than proportional increase in DVM¢ajjop, aS 
might be expected for migrating animals that may be multiple trophic 
levels removed from the phytoplankton. Consequently, the night-day 
difference in bbp caused by the diel cycle of phytoplankton (Fig. 1b) 
generally increases more rapidly than the DVM signal as productivity 
increases, resulting in negative Abbp values in productive tropical 
regions (Fig. 2). 

Evidence of long-term changes in zooplankton populations, often 
linked to climate oscillations, has emerged from field time-series stud- 
ies”, At the HOT site, field-measured DVM zooplankton biomass 
increased on average by 12.4 mg m” year ' (38% per decade) between 
1994 and 2005 (P= 0.04)”. The DVM¢ariop record indicates that this 
trend continued (P= 0.05) at a rate of 23% per decade from 2008 to 
2017 (Fig. 4c). At BATS, an overall increasing trend of 7.4 mg m7 year? 
(54% per decade) was reported for field observations from 1994 to 
2011 (P< 0.01)”. Expanding this dataset to 2017 and re-evaluating the 
data, we find that DVM biomass increased 63% per decade from 1994 
to 2007 (P= 0.01) and then decreased 28% per decade from 2008 to 
2017 (P= 0.04). For this latter period, DVM¢ajjop data similarly sug- 
gest a decreasing trend (9% per decade; P> 0.1) for the 2° x 2° bin that 
encompasses the BATS site (Fig. 4c). The global coverage provided by 
CALIOP now allows the evaluation of DVM biomass temporal trends 


to be greatly expanded. For example, subdividing our 8 PSO regions 
(Fig. 3a) into 6° x 6° bins and evaluating only those bins that remain 
within their respective regional boundaries throughout the year 
(Extended Data Fig. 2) reveals that decadal trends in DVM¢arjop have 
coherent geographical patterns (Fig. 4c). Specifically, a predominance 
of increasing DVM animal biomass is observed inthe NPSG, SPSG, SASG 
and SISG, whereas decreasing DVM biomass is indicated across much 
of the tropical regions and the NASG. Moreover, DVM¢a, jop iS positively 
correlated with changes in phytoplankton production for most bins, 
but in the two tropical Atlantic regions the correlations are counter- 
intuitively inverse (Fig. 4c). 

The power of a satellite lidar when studying marine animals lies in 
its unparalleled annual coverage of the global ocean. By contrast, in 
this study, it was necessary to compile more than 30 years of ADCP 
measurements to create a single field-test dataset, and even then spa- 
tialand temporal coverage remained poor in many regions (Extended 
Data Fig. 4). A challenge with satellite lidar data, however, is that the 
measured night-day bbp differences are not solely owing to DVM 
animals, so continued work is needed to refine descriptions of the 
non-DVM contributors (Methods and Supplementary Discussion). 
Further analyses of field data (for example, ADCP data or continuous 
plankton recorder survey observations) and modelling”® are needed 
to fully understand the causative ecological processes that underlie 
the spatial and temporal DVM patterns that are observed from space, 
and an advanced satellite lidar with ocean-profiling capabilities” may 
contribute new insights into DVM behaviours (Supplementary Discus- 
sion). Although there are multiple new avenues to pursue regarding the 
study of DVM animals, the results presented here provide astep forward 
in the global exploration of this greatest animal migration on Earth. 
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Methods 


Field data 

Data shown in Fig. la provide five examples from the subarctic Pacific in 
which DVM animals significantly increased ship-measured bbp at night. 
Data are froma previous study”, which also provides a full description 
of the study region, measurement system, data processing and uncer- 
tainty analyses. The phytoplankton diel cycle in bbp shown in Fig. 1b 
was calculated from previously published data” and is normalized to 
lat the night-time minimum (Supplementary Discussion). 


CALIOP data 

CALIOP is the primary instrument on the cloud-aerosol lidar and infra- 
red pathfinder satellite observation (CALIPSO) platform”, which was 
launched in 2006. CALIOP measures the total time-dependent return 
of a pulsed laser output at 1,064 nm and both the co-polarized and 
cross-polarized return of a 532-nm laser output. The current study 
is based on night and day differences in the cross-polarized return at 
532 nm. The bbp values (available at http://www.science.oregonstate. 
edu/ocean.productivity/) were calculated from this signal following 
a previously published study”, except that bbp at 532 nm was not 
converted to bbp at 440 nm as in the previous study. In the previous 
study", 30° off-nadir CALIOP tilting manoeuvres were conducted every 
2 weeks between 2015 and 2016 to collect ocean measurements with 
minimal surface backscatter froma wide range of ocean environments. 
Removing molecular backscatter from the CALIOP parallel channel and 
combining with data from the cross-polarized channel, we then derived 
particulate backscatter depolarization ratios and compared these 
data to collocated MODIS diffuse attenuation (K,) values. This analysis 
yielded alinear relationship between CALIOP depolarization ratios and 
the K, that was used in the previous study’ to directly retrieve bbp val- 
ues from CALIOP without reliance on collocated MODIS observations. 
This same approach was used for the current study. However, we also 
further evaluated the relationship between particulate depolarization 
ratios and K, at 532 nm using an extensive open-ocean airborne lidar 
dataset compiled from the 2012 Azores campaign“, the Ship-Aircraft 
Bio-Optical Research (SABOR) campaign and three campaigns of the 
North Atlantic Aerosol and Marine Ecosystem Study (NAAMES)” (no 
airborne data were collected during the fourth NAAMES campaign as 
mechanical issues grounded the plane). This airborne-based analysis 
confirmed the previous CALIOP-MODIS finding of a linear relationship 
between depolarization ratios and K,, with a mean depolarization:K, 
ratio of 1.76 mand s.d. of 0.19 m. Finally, we calculated bbp using a 
volume scattering function (VSF) at 180° following a previously pub- 
lished study” that was developed using measurements from collocated 
CALIPSO and MODIS 531-nm data. This VSF is consistent with scattering 
properties for particles in the phytoplankton size domain and it was 
applied to both day and night CALIOP data. Organisms larger than 
phytoplankton, such as DVM animals, will have a VSF witha lower effi- 
ciency inthe backward direction and enhanced efficiency in the forward 
direction. This difference is one factor (Supplementary Discussion) that 
influences the relationship between the CALIOP-retrieved DVM back- 
scatter signal and the biomass of these animals, but it is encompassed 
inthe field-based scaling factor between CALIOP bbp"2"'- bbp”” data 
and zooplankton dry weight (see below). 

CALIOP merges low-gain and high-gain data onboard to reduce data 
downlink. The onboard gain ratio calculations fail when there are not 
enough data for both the low- and high-gain channels, which can hap- 
pen for night-time cross-polarization measurements. At the beginning 
of the CALIPSO mission, there was an error in the default values of the 
gainratios when the onboard calculation fails. This issue rendered the 
night-time cross-polarization measurements unusable for ocean bbp 
retrievals until the error was corrected in late 2007, which is why the 
current study on night-day differences in bbp was limited to the period 
of 2008-2017. In addition, advanced microwave scanning radiometer 
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data were used to flag and omit CALIOP retrievals made at wind speeds 
of >9 ms“to avoid bubble contamination of the bbp values. At wind 
speeds of <9 ms“, wealso applied a depolarization ratio threshold to 
remove bubble-contaminated data based on statistical analysis from 
high-wind conditions. 


Calculation of DVMeariop 
The biomass of DVM animals (DVM gay op) Was estimated from CALIOP- 
measured night-day differences in bbp, an estimate of the day-to-night 
change in bbp due to the phytoplankton diel cycle alone (Fig. 1b), and 
ascaling factor between backscattering and zooplankton dry weight. 
The value of bbp™2" can be expanded into backscatter by DVM 
animals (bbp>™) and backscatter by phytoplankton and other non- 
migrating particles (bbp*""): 


bbp™2"* 7 bbp™ re bbp*"*" (4) 


The value of bbp“"* is equal to bbp*®” corrected for the phytoplankton- 
based change (c) in bbp from day to night. Rearranging equation (4) 
and solving for bbp™ yields: 


bbp-™ =bbp™2"* = cbbp”” (5) 


A variety of factors may influence the value of c, including the daily 
division rate of the phytoplankton population, the degree to which this 
division is synchronized around the day-night cycle, the composition 
of the phytoplankton community* and day length. Over much of the 
PSO, phytoplankton populations are dominated by Prochlorococcus, 
Synechococcus and picoeukaryotics species, all of which generally 
synchronize cell division to the first half of the night’. We therefore 
focused on describing casa function of both division rate (u; divisions 
per day) and day length. 

First, the dependence of c ony was determined from a previously 
published laboratory study™ and is described by (Extended Data Fig. 5a 
and Supplementary Discussion): 


c,= 0.929 - 0.122 (6) 


in which the term —0.122 has units of days per division. Equation (6) was 
applied to MODIS passive ocean-colour-based estimates of calculated 
for the 2008-2017 period (see below). Second, the dependence of con 
day length exists because a change in day length causes the two CALIOP 
sampling points (determined by its fixed orbit) to line up differently 
with the phytoplankton diel cycle. For the range of day lengths encoun- 
tered across the PSO, this day length (dl) dependence is described by 
(Extended Data Fig. 5b and Supplementary Discussion): 


C)= 0.986 + 0.0012dI (7) 


in which dl (h) is known precisely for each CALIOP pixel based on date 
and latitude and the term 0.0012 has units of h™. 

Monthly DVM zooplankton dry-weight data are available at the BATS 
site over the 2008-2017 CALIOP period. Comparison of these field 
data to monthly bbp>™ values calculated from equations (5)-(7) for 
the 2° x 2° CALIOP bin centred on the BATS site yields a mean scaling 
factor between bbp>™ and DVM biomass of 1,596 g m™ (Supplemen- 
tary Discussion). Applying this conversion factor gives the following 
expression for DVM¢arjop: 


DVMcatiop = 1, 596(bbp™=" S cycsbbp™”) (3) 


In Fig. 3c, d, we compare HOT and BATS DVM zooplankton dry-weight 
data to the CALIOP-retrieved property, Abbp, simply to maintain 
consistency with Fig. 3b (Supplementary Discussion). However, DVM 
biomass should be quantitatively related to the simple difference, 


bbp"2"'- bbp“”. The reason that the seasonal cyclein Abbp corresponds 
to that of DVM biomass at HOT and BATS is because bbp™ varies lit- 
tle over the year at these two sites, making Abbp highly correlated 
(r > 0.99, P< 0.001) with the simple difference, bbp"®" - bbp*”. 


Field ADCP data 

Global ADCP data were obtained from the Joint Archive for Shipboard 
ADCP (JASADCP, http://ilikai.soest.hawaii.edu/sadcp/). We selected 
acoustic backscatter data for the upper 20 m of the ocean surface 
(equivalent to the sampling depth of CALIOP) from ADCPs with fre- 
quencies of 150 or 300 kHz and excluded data from frequencies of 38 
and 75 kHz. The frequencies of 150 and 300 kHz better isolate scat- 
tering from the small (about 0.5-5 mm) animals that are most likely 
to be detected by CALIOP owing to their much greater abundances. 
Backscatter amplitude (F) measurements were converted into acoustic 
intensity (/,,) following a previously published method”: 


Igp= 10 log(10ke£/1 — 10 ke Fnoise/1) (9) 


in which k, is a scaling factor used to convert backscatter amplitude 
counts to decibels (dB) and F,,,;,. is the noise floor of the individual 
ADCP dataset. Values of k, are frequency-dependent and were taken 
froma previous study*®. The noise floor was defined as the minimum 
k£ for each individual data file. A total of 7,622 individual paired diel 
cycles were extracted from the global database (Extended Data Fig. 4). 
The geographical distribution of these diel cycle data is largely biased 
to the regions surrounding the Hawaiian Islands, along the Tropical 
Ocean atmosphere (TAO/TRITON) mooring array in the equatorial 
Pacific, the Southern California Bight, between Chile and Antarctica, 
and off the northeast United States seaboard (Extended Data Fig. 4). 
Detailed inspection of echograms computed from each individual data 
file revealed occasional time-synchronicity issues, with the apparent 
deep DVM occurring more than 2h before or after local sunrise or sun- 
set. To validate the time stamp of each data file, we compared the timing 
of the mesopelagic (maximum depth of the data file to 150 m) DVMto 
the time of local sunrise and sunset. Time-synchronicity issues were 
detected inless than 10% of the data files. The time stamp of these errant 
ADCP backscattering observations were corrected to synchronize the 
deep DVM with sunrise or sunset before computing the day-night dif- 
ferences analysed in the manuscript. 


Field time-series zooplankton biomass data 

Monthly climatological migratory zooplankton dry-weight data shown 
in Fig. 3c for HOT were calculated using data from Fig. 1 ofa previously 
published study”. Monthly climatological migratory zooplankton 
dry-weight data shown in Fig. 3d for BATS were calculated from data 
from 1994 to 2017 provided by D.K.S. The 1994-2011 subset of these 
data has previously been published”. 


Global phytoplankton NPP, biomass, and division rate data 

Phytoplankton NPP values used for Fig. 4b, c, phytoplankton division 
rates (u) used for calculating c, in equation (6) and phytoplankton 
biomass (C,1y1.) Shown in Extended Data Fig. 6 are from the CbPM* 


using MODIS passive ocean-colour data collected between 2008 and 
2017 (data are available at http://www.science.oregonstate.edu/ocean. 
productivity/). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The CALIOP lidar and field ADCP datasets analysed during the current 
study are available at http://www.science.oregonstate.edu/ocean.pro- 
ductivity/and from the Joint Archive for Shipboard ADCP at http://ilikai. 
soest.hawaii.edu/sadcp/. Source Data for Figs. 3b-d, 4b and Extended 
Data Figs. 3, 6-8 are provided with the paper. 
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Extended Data Fig. 1| Global CALIOP observational coverage. a—e, CALIOP 
groundtracks for 1 (a), 2 (b), 4 (c), 8 (d) and 16 (e) days. f, Number of months for 
each 2° x 2° bin with day and night retrievals of bbp for the 2008-2017 study 
period. The total number of months possible is 115. The north-south strip of 
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low retrieval success in the middle of the Pacific is caused by a gap in ancillary 
AMSER surface wind data. AMSER wind data are used for flagging CALIOP data 
with potential bubble contamination (Methods). 
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Extended Data Fig. 2| Global distributions of monthly climatological mean values of Abbp. Thin black line, contour of monthly mean sea surface temperature 
of 15°C. Thick black lines, monthly extent of the 5 subtropical gyres in which annual mean surface chlorophyll concentrations are <0.08 mg m*. 
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Extended Data Fig. 3| Time series of Abbp for the PSO. a—h, The 2008-2017 monthly values of Abbp (%) for the eight PSO regions described in Fig. 3a. 
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Extended Data Fig. 4 | Global coverage of field ADCP data. Number of days sadcp/). The total number of days possible is 11,680. White bins, no data. 


within each S° x 5° bin for which paired day-night ADCP data are available from Yellow/black line, contour of annual mean sea surface temperature of 15 °C. 
the 1985-2017 JASADCP-based field archive (http://ilikai.soest.hawaii.edu/ Thick white lines, boundaries of the eight PSO regions described in Fig. 3a. 
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Extended Data Fig. 5 | Influence of phytoplankton division rate and of day 
length on calculated DVM backscatter for the PSO. a, Values for c, 

(equation (6)) over the range of phytoplankton division rates (1) inthe PSO 
(n=999 monthly p values for all PSO regions). Solid circle, mean value of pand 
c, for the PSO. The box shows +1s.d. of the mean of wand the solid line shows 
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Day length (h) 


values of c, over the full range in u for the PSO. b, Values for c, (equation (7)) over 
the range of day lengths in the PSO (n=999 monthly day length values for all 
PSO regions). Solid circle, mean day length and c, value for the PSO. The box 
shows +1s.d. of the mean day length and the solid line shows values of c, over 
the full range in day length for the PSO. 
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Extended Data Fig. 6 | Seasonal cycles in monthly mean regionally 
integrated values of DVM¢4,;op and phytoplankton biomass for high-latitude 
regions. a, North Pacific. b, North Atlantic. c, Southern Ocean. These three 
regions are described in Fig. 4c. Vertical lines show +1s.d. (n=111 monthly 


DVM cauiop (gm) and Cyhyto (mg C m™) values for each region). Cyny:0 data are 


from the carbon-based production model (CbPM) and MODIS passive ocean- 
colour data (Methods). 
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Extended Data Fig. 7 | Comparison of CALIOP night-day bbp differences and mean +s.e.m. (SISG, n=19; SASG, n=18; TP,n=115; NASG, n=16; NTA, n=23; 
field ADCP night-day differences in acoustic backscatter. Dashed line, two- NPSG, n=59; STA, n=22; SPSG, n=59; other, n=302). Symbol colours identify 
sided least-squares linear regression fit to data for the SPSG, NPSG, TP, STA, region (labelled on the right) and correspond to the colours shown in Fig. 3a. 
NASG and NTA (n= 6). For completeness, the mean value for PSO bins outside Numbers next to each symbol indicate the median number of days with ADCP 
our eight primary regions is indicated by the white symbol. Symbols, regional data within the 32-year field record for the 5°x 5° bins. 
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Extended Data Fig. 8 | Bin-to-bin comparison of CALIOP Abbp and field- 
based DVM measurements in the PSO. a, CALIOP normalized difference ratios 
(Abbp) versus field ADCP (ABapcp) normalized difference ratios for 5° x 5° bins 
within the PSO. Black line, two-sided least-squares linear regression fit (F-test 
Pvalue for slope; P< 0.001; n=331independent geographical bins) for all data 
from our eight primary PSO regions (coloured symbols are labelled onthe 
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right). White symbols, PSO values for 5° x 5° bins outside of the 8 primary 
regions. Inclusion of these data inthe linear regression analysis increases 

the F-test value to P=0.005 (n= 633 independent geographical bins). 

b, Relationship between field DVM biomass at the HOT site measured fora 
given calendar month and year (xaxis) versus DVM biomass measured during 
all other years for the same calendar month (y axis). 
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Global CALIOP satellite lidar data were collected continuously over a 10 year period at 330 m sampling distance between retrievals, 
which are only made along the satellite orbit track (a detailed description of CALIOP data coverage is provided in the Supplementary 
Discussion for the manuscript). These data were compared to historical ADCP field measurements of acoustic backscatter, with a 
total of 7,622 day-night ADCP measurement pairs. CALIOP data were also compared to field zooplankton measurements at the long- 
term HOT and BATS field sampling sights, with a total of 397 zooplankton collections for the two sites. 


Global CALIOP satellite data shown in figure 2 of the main manuscript were binned into 2 degree latitude by 2 degree longitude bins 
to provide high signal-to-noise retrievals at high spatial resolution. For the CALIOP-ADCP comparison, CALIOP data were binned into 
5 degree latitude by 5 degree longitude bins to be comparable with the much coarser resolution of field ADCP data. For figure 4c of 
the manuscript, CALIOP data were binned into 6 degree latitude by 6 degree longitude bins to increase signal-to-noise for detecting 
temporal trends in the CALIOP record. CALIOP data analyzed during the current study are available at http:// 
www.science.oregonstate.edu/ocean.productivity/ and are reflective of optical properties in the upper 22 m of the ocean surface. 
For ADCP data, all available field measurements from the Joint Archive for Shipboard ADCP (JASADCP, http://ilikai.soest.hawaii.edu/ 
sadcp/) were obtained and then only those measurements conducted at frequencies of 150 or 300 kHz were used for comparison 
with CALIOP retrievals because these frequencies better isolate scattering from the small (~0.5 — 5 mm) animals that are most likely 
detected by CALIOP. ADCP measurements provide depth-resolved information on acoustic backscatter. HOT and BATS field data 
were taken from the publications from Hannides, C.C.S, et al. (2009) Deep-Sea Res. |. 56, 73-88 and Steinberg, D.K., et al. (2012) 
Global Biogeochem. Cycl. 26, GB1004, doi:10.1029/2010GB004026, respectively. Sampling frequency was dictated by the protocols 
of the respective programs and DVM biomass is determined by the difference in zooplankton abundance between day and night net 
collections. 


No statistical method was used to determine CALIOP sampling, as the frequency of sampling is determined by the hardware of the 
lidar instrument. For the current study, the native resolution of CALIOP data is far greater than the analysis bins used for assessing 
DVM patterns and this high sampling resolution improves the signal-to-noise of the retrieved properties. For ADCP data, no 
statistical method was used to determine sampling frequency as the native resolution of these data is far greater than the single day- 
night differences used to assess the DVM signal. HOT and BATS data were taken from the published literature and the multi-year 
monthly sampling frequency of these data (dictated by predefined program protocols) typically provide approximately 10 or more 
measurements of DVM biomass for each month from which the mean annual cycle was calculated for the current analysis. 


Optical signals were obtained using the CALIOP satellite lidar. ADCP data were collected by a diversity of researchers on historical 
ship transects. HOT and BATS data are previously published, as noted above, and are products of the long-term HOT and BATS 
programs 


CALIOP data were collected between 2008 and 2017 at a native spatial resolution of 330 m sampling distance between retrievals 
along the satellite orbit track and then combined into various sized bins as described above. ADCP data were collected between 
1985 and 2017 and then all available data were globally combined into 5 degree latitude by 5 degree longitude geographic bins . 
HOT DVM samples were collected between 1994 and 2005 using a plankton net with a 1 square meter opening at the location 
indicated in figure 3a of the manuscript. BATS data were collected between 1994 and 2017 using a plankton net with a 0.8 m x 1.2 
meter opening at the location indicated in figure 3a of the manuscript. 


Global CALIOP data were initially analyzed and then subsampled for data within the PSO. Higher latitude data were excluded for the 
CALIOP-ADCP comparison in figure 3 because poor sampling coverage and high spatial variability in plankton properties at these 
latitudes resulted in noisy day-night comparison results (see detailed discussion in Supplementary Discussion). CALIOP exclusion 
criteria were not pre-established. ADCP data were subsampled to only include measurement frequencies of 150 or 300 kHz because 
these frequencies better isolate scattering from the small (~O.5 —5 mm) animals that are most likely detected by CALIOP. Thus, 
exclusion criteria were pre-established. These ADCP data were then quality controlled as described in the Methods section. All HOT 
and BATS data reported in the Hannides, C.C.S, et al. and Steinberg, D.K., et al. manuscripts cited above were included. 


Validation of the CALIOP results was achieved through comparison with historical global ADCP data and zooplankton net data from 
BATS and HOT. Successful replication was achieved for all regions and months by combining data from multiple years of CALIOP, 
ADCP, and zooplankton net measurements 


The 8 PSO regions evaluated were defined first for the 5 gyres using the stated chlorophyll criterion in the manuscript and then the 3 
remaining equatorial regions based as the region spanning between the north and south gyres. In the Pacific a single equatorial 


region was defined and in the Atlantic the equatorial region was split into northern and southern sections 


Our study did not require blinding because it did not involve experiments 
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Gigantopithecus blackiwas a giant hominid that inhabited densely forested 
environments of Southeast Asia during the Pleistocene epoch’. Its evolutionary 
relationships to other great ape species, and the divergence of these species during 
the Middle and Late Miocene epoch (16-5.3 million years ago), remain unclear’. 
Hypotheses regarding the relationships between Gigantopithecus and extinct and 
extant hominids are wide ranging but difficult to substantiate because of its highly 
derived dentognathic morphology, the absence of cranial and post-cranial 


remains! ° 


, and the lack of independent molecular validation. We retrieved dental 


enamel proteome sequences from a1.9-million-year-old G. blacki molar found in 
Chuifeng Cave, China’*. The thermal age of these protein sequences is approximately 
five times greater than that of any previously published mammalian proteome or 
genome. We demonstrate that Gigantopithecus is a sister clade to orangutans (genus 
Pongo) witha common ancestor about 12-10 million years ago, implying that the 
divergence of Gigantopithecus from Pongo forms part of the Miocene radiation of 
great apes. In addition, we hypothesize that the expression of alpha-2-HS- 
glycoprotein, which has not been previously observed in enamel proteomes, hada 
role in the biomineralization of the thick enamel crowns that characterize the large 


molars in Gigantopithecus 


910 The survival of an Early Pleistocene dental enamel 


proteome inthe subtropics further expands the scope of palaeoproteomic analysis 
into geographical areas and time periods previously considered incompatible with 
the preservation of substantial amounts of genetic information. 


G. blackiis an extinct, potentially giant hominid species that once inhab- 
ited Asia. It was first discovered and identified by von Koenigswaldin1935, 
when he described an isolated tooth that he found in a Hong Kong drug- 
store”. The entire G. blacki fossil record, dated between the Early Pleis- 
tocene (about 2 million years ago (Ma)) and the late Middle Pleistocene 
(about 0.3 Ma)”, includes thousands of teeth and four partial mandibles 
from subtropical Southeast Asia’”"*. All the locations at which G. blacki 
remains have been found are in or near southern China, stretching from 
Longgupo Cave, just south of the Yangtze River, to the Xinchong Cave 
onHainan Island and, possibly, into northern Vietnam and Thailand», 


To address the evolutionary relationships between Gigantopithecus 
and extant hominoids, we performed protein extractions on dentine 
and enamel samples of a single molar (CF-B-16) found in Chuifeng Cave, 
China (Extended Data Figs. 1, 2), that has been morphologically assigned 
to G. blacki’®. The site has been dated, using multiple approaches, to 
1.9+0.2 Ma. We processed enamel and dentine samples using recently 
established digestion-free protocols that were optimized for extremely 
degraded ancient proteomes” (see Methods). Enamel deminerali- 
zation was replicated using two acids, trifluoroacetic acid (TFA) and 
hydrochloric acid (HCl). 
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Fig. 1| Thermal age of Chuifeng Cave, China, inthe subtropics of Southeast 
Asia. Chronological (circles) and thermal (coloured wedges, @10 °C) ages of 
other Cenozoic ancient genomes and proteomes are given for comparison. For 
Chuifeng Cave, dark red shading indicates thermal age calculated ata 
chronological age of 1.9 Ma and lighter shading indicates 95% confidence 


We identified no endogenous proteins from the dentine, but 
instead recovered an ancient enamel proteome composed of 409 
unique peptides matching 6 endogenous proteins: amelogenin X 
(AMELX), ameloblastin (AMBN), amelotin (AMTN), enamelin(ENAM), 
matrix metalloproteinase-20 (MMP20) and alpha-2-HS-glycoprotein 
(AHSG, also known as FETUA; Extended Data Table 1). This observa- 
tion extends the survival of ancient mammalian proteins to a thermal 
age, obtained by normalizing the chronological age to aconstant tem- 
perature of 10 °C, of approximately 11.8 million years (Myr)@10 °C 
(Extended Data Table 2). Such a thermal age is well beyond the ther- 
mally oldest DNA (0.25 Myr@10 °C, Sima de los Huesos, Spain’’), col- 
lagen (0.22 Myr@10 °C, Happisburgh, UK’) and enamel proteome 
(2.2 Myr@10 °C, Dmanisi, Georgia”) reported to date. The Chuifeng 
Cave enamel proteomeis thus, to our knowledge, the oldest Cenozoic 
skeletal proteome currently reported (Fig. 1). The survival of a subtropi- 
cal proteome from approximately 2 Ma suggests that chronologically 
older specimens from higher latitudes are likely to contain preserved 
ancient proteomes as well. 

The content of the recovered enamel proteome is consistent with 
previously reported ancient enamel proteomes””°”, with the addition 
of several peptides deriving from a single region of AHSG. Peptide 
matches to these proteins covered a minimum of 43 informative single 
amino acid polymorphisms (SAPs; Supplementary Table 3). In addi- 
tion, the retrieved protein regions fell largely within areas previously 
recovered from an Early Pleistocene Stephanorhinus enamel proteome 
from Dmanisi” (Supplementary Fig. 1). The absence of peptides specific 
to amelogenin Y (AMELY) suggests that the sampled molar might have 


T T i T 
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Age (years) 


intervals based onachronological age of 1.7 and 2.1Ma, respectively. Inset, 
geographical location of Chuifeng Cave in the subtropics of Southeast Asia (red 
asterisk). Base map was generated using public domain data from http://www. 
naturalearthdata.com/. The red asterisk also encloses the entire known 
geographical range of G. blacki fossils. 


belonged to a female Gigantopithecus. Alternatively, male-diagnostic 
AMELY-specific peptides were not observed due to their degradation 
beyond the limit of detection of the instrument. The endogenous 
enamel proteome sequence coverage of 456 amino acids is lower than 
the previously recovered sequence coverage for a Dmanisi Stepha- 
norhinus specimen (875 amino acids”; Supplementary Table 1). This 
observation is consistent with the older thermal age for Chuifeng Cave, 
compared to Dmanisi”. 

We replicated enamel demineralization using two acids (TFA and 
HCl). The chromatograms of these two extracts showed that different 
peptide populations were released (Extended Data Fig. 3). Owing to the 
partial acidic hydrolysis”, which potentially occurs alongside deminer- 
alization, peptide populations with a wider range of acidity (Extended 
Data Fig. 4a) and hydrophobicity (Extended Data Fig. 4c) are gener- 
ated using TFA. Our TFA-based demineralization returned 127 more 
unique non-overlapping peptide sequences than did the HCl-based 
demineralization (Extended Data Fig. 4e). The TFA extract, therefore, 
outperformed the HCI-based extraction, despite being carried out on 
asmaller amount of starting material”. Ultimately, the extended cover- 
age of TFA-based demineralization increased the identification rate of 
informative SAPs, enhancing the phylogenetic information obtained 
(Extended Data Fig. 4d). Finally, the HCI- and TFA-demineralized 
samples had similar deamidation rates and average peptide lengths 
(Extended Data Fig. 5), which indicates that the two acids released 
peptide populations that were modified to the same extent. 

The Gigantopithecus enamel proteome is characterized by extensive 
diagenetic modifications, suchas high rates of deamidation (Fig. 2a) and 
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Fig. 2| Modifications and degradation of Gigantopithecus enamel 
proteome.a, Violin plots of asparagine (N) and glutamine (Q) deamidation for 
selected proteins (n=1,000 bootstrap replicates of intensity-based peptide 
deamidation”). Human dermcidin (DCD) is included as an example of a non- 
deamidated contaminant. For AMBN, all observed asparagines and glutamines 
are deamidated. For AMELX, all asparagines are deamidated. For DCD, no 
observed asparagines and glutamines are deamidated. The number of 
peptides used for the calculation are shown at the bottom. b, Violin plots of 
peptide lengths for Gigantopithecus, an Early Pleistocene rhinoceros from 
Dmanisi anda Medieval control sample”. c, Sequence-motif analysis of the 


a high degree of degradation, as indicated by relatively short peptide 
lengths (Fig. 2b); this is expected for an ancient proteome preserved in 
subtropical conditions. When quantifying peptide intensities using label- 
free quantification (LFQ) implemented in MaxQuant”, we observed that 
summed and normalized MSI spectral intensities were higher for shorter 
peptides than for longer peptides (Extended Data Fig. 4b). Finally, the 
peptide lengths of the Chuifeng Cave enamel proteome were shorter 
than those identified in thermally younger enamel proteomes (Fig. 2b). 
Enamel-specific proteins are modified in vivo through protein phos- 
phorylation, alternative splicing of AMELX, and MMP20- and KLK4- 
mediated proteolysis. Such modifications could survive in ancient 
proteomes. We detected evidence of surviving in vivo post-translational 
modifications, such as serine phosphorylation in the S-x-E/phS motif, 
whichis recognized by the secreted kinase FAM20C (Fig. 2c). The FAM20C 
kinase regulates the phosphorylation of extracellular proteins involved 
in biomineralization™. We also observed two alternative-splicing-derived 
isoforms of AMELX (Fig. 2d). These observations are similar to those from 
other Early Pleistocene enamel proteomes”. The Gigantopithecus enamel 
proteome therefore demonstrates that such in vivo modifications can 
be recovered from hominid samples across the Pleistocene. 
Toachievea protein-based phylogenetic placement of Gigantopithecus, 
we compared the enamel proteome sequences we retrieved with those 
of extant apes (Hominoidea). We used publicly available whole-genome 
sequence data to predict enamel protein sequences from relevant 
species” (Supplementary Table 2, Supplementary Figs. 2-12). Our 
results show that Gigantopithecus represents a sister taxon to all extant 
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over-representation of specific amino acids around the phosphorylated amino 
acid (position 0;n=14). d, Peptide coverage of AMELX protein isoforms. 
Matching peptides are indicated by black bars for isoform 1 (n=21) and isoform 
3(n=7). The latter includes an insertion due to alternative splicing between 
isoform 1, amino acid positions 34 and 35 (coordinates in reference to UniProt 
Accession number: Q99217-1 [AMELX_HUMAN]). bincludes dataonAMELX, 
AMBN, ENAM, AMTNand MMP20 only. For aandb, box plots define the range 
of the data (whiskers extending to1.5x the interquartile range), 25th and 75th 
percentiles (boxes) and medians (dots/lines). For a, outliers are indicated as 
black dots (beyond 1.5x the interquartile range). 


orangutans (Pongo spp.) and forms a monophyletic group with extant 
pongines (Fig. 3a, Extended Data Figs. 6, 7). We then attempted to estimate 
the divergence time between Gigantopithecus and Pongo species using two 
approaches: (1) a pairwise distance approach and (2) a Bayesian approach 
using MrBayes (see Methods). Although confidence intervals obtained 
forthe divergence estimates of the Pongo-Gigantopithecus split are large, 
our results indicate that Gigantopithecus diverged fromthe extant Pongo 
species inthe Middle or Late Miocene (approximately 10 Ma or 12 Mausing 
the Bayesian or pairwise distance approaches, respectively; Fig. 3b). This 
suggests that, despite an exclusively Pleistocene fossil record, Giganto- 
pithecus is amember of an early radiation of pongines, whose diversity 
peaked during the Middle and Late Miocene (Fig. 3b). Our results thereby 
resolve not only the phylogenetic position of Gigantopithecus, but also 
renewthe debate onthe evolutionary relationships between extant homi- 
nids and early hominids present in the fossil record’. 

The presence of AHSG in the Gigantopithecus proteome is intrigu- 
ing, as this protein is not commonly observed in (modern) hominid 
enamel proteomes. All retrieved peptides derive froma single, highly 
conserved region that is bordered by disulfide cysteine bonds on 
either side (Extended Data Fig. 8). AHSG is highly glycosylated in vivo, 
but we observed no glycosylation during our bioinformatics analysis. 
The observed sequence contains regularly spaced aspartic acid residues 
that provide a suitable motif for binding to basic calcium phosphate 
lattices”’. The notion that this specific peptide sequence is involved in 
biomineral binding is supported by three observations: (1) this region 
is presented on the external surface of AHSG’S; (2) such surfaces have 
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Fig. 3 | Bayesian phylogeny and divergence of Gigantopithecus among 
extant apes (Hominoidea). a, Time-calibrated Bayesian phylogeny of 
Gigantopithecus. Circled nodes were fixed for topology (see Methods). Grey 
error bars represent the 95% highest posterior density (HPD) intervals for the 
divergence estimates. b, Distribution of probable and possible extinct 
pongines known from the fossil record, including Gigantopithecus (black bars). 
Grey bars represent the 95% HPD interval obtained from the Bayesian approach 
and the 95% confidence interval obtained from the pairwise-distance-based 
approach, of the Gigantopithecus-Pongo divergence. 


been demonstrated to bind biominerals in other systems as well’; and 
(3) this type of binding enhances peptide preservation”. AHSG acts 
as a key component of bone and dentine mineralization by inhibiting 
the extrafibrillar mineralization of collagen type I helices”° and has 
previously been hypothesized to have a role in amelogenesis’. Our 
extracts contained no endogenous plasma proteins, such as human 
serum albumin, or other common dentine proteins, such as collagen 
typel. We also did notidentify any AHSG peptides in our dentine sample. 
We therefore exclude the possibility that the AHSG peptides derived 
from dentine. Gigantopithecus is known to have relatively long enamel 
formation times and thick enamel compared to several extant and 
extinct hominids, including its phylogenetically closest relatives!®*). 
We therefore hypothesize that Gigantopithecus recruited AHSG as an 
additional molecular component to favour enamel biomineralization 
during prolonged amelogenesis, ultimately playing a part comparable 
to the one it has in bone and dentine mineralization’. 

Our results reveal the long-debated phylogenetic position of Gigan- 
topithecus as an early diverging pongine. We demonstrate that ancient 
enamel proteomes can be retrieved from Early Pleistocene samples 
preserved in subtropical conditions, well beyond the current limita- 
tions of biomolecular research in hominid and hominin evolution. In 
addition, the survival of an Early Pleistocene Gigantopithecus enamel 
proteome enables us to assess the presence of multiple forms of in vivo 
modification. Finally, palaeoproteomic analysis allowed us to identify 
a hitherto unknown biological component of tooth formation in an 
extinct hominid. These findings suggest that the palaeoproteomic 
analysis of hominid enamel has the potential to provide a molecular 
perspective on human and great ape evolution. 
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Methods 


Chuifeng Cave 

The Chuifeng Cave (23° 34’ 27” N, 107° 00’ 22” E) is one of the most 
representative sites for the Early Pleistocene G. blackifauna®. The site 
is located in the Bubing Basin in the northwestern part of the Guangxi 
Zhuang Autonomous Region, south China (Extended Data Fig. 1). The 
cave is 19 mlong, 0.5-2 m wide and 1.5-5 m high, penetrating the lime- 
stone from southeast to northwest at a height of ~77 m above the local 
valley floor. A fossiliferous sandy clay with a few limestone breccias 
fills most of the cave, with an average depth of 1.3 m (Extended Data 
Fig. 2). Four excavation areas (A, B, C and D) were excavated down to 
limestone bedrock at 10-cm intervals. Twenty-four large mammalian 
species, including 92 G. blackiteeth, were unearthed from the cave®. The 
Chuifeng Cave mammalian fauna is characterized by the occurrence 
of typical Early Pleistocene species, such as Hystrix magna, Sinomas- 
todon sp., Stegodon preorientalis, Ailuropoda microta, Pachycrocuta 
licenti, Tapirus sanyuanensis and Sus peii®. This mammalian fauna 
is comparable with other Gigantopithecus-containing faunas of the 
Early Pleistocene in southern China, such as Baikong™®, Longgupo™* 
and Liucheng®. The mammalian fauna composition is consistent with 
the results of combined electron spin resonance (ESR)/U-series dating 
and sediment palaeomagnetic studies (-1.9 Ma)*°. In the present study, 
we collected one well-preserved G. blacki tooth (excavation number 
CF-B-16) for palaeoproteomic analysis. This tooth was excavated from 
area B at a depth of 90 cm from the sediment surface and, based on 
its stratigraphic position, dated to ~1.9 Ma. No other specimens were 
tested before CF-B-16, and no specific selection was made as to which 
Gigantopithecus tooth would be analysed. 


Thermal age 

Thermal age was calculated to allow comparison with previously pub- 
lished ancient genomes, ancient proteomes and collagen peptide mass 
fingerprinting studies, from other temporal and geographical locali- 
ties. Temperature estimates for the hominin occupation of Dmanisi 
based on herpetological fauna suggest a temperature about 3.1°C 
above the current MAT”, while the sea surface temperature record 
used” predicts a negative AT at the time of hominin occupation. Given 
this discrepancy and the widely different temperature estimates for the 
Last Glacial Maximum (LGM) in the Caucasus, we conservatively use 
ascale factor of 0, correlating with a A7 of approximately —0.2 °C and 
a current MAT of 11.2 °C. Our thermal age prediction for Dmanisi 
(2.2 Myr@10 °C) should therefore be seen as conservative. The thermal 
age for Chuifeng Cave was calculated with a general lapse rate between 
mean annual temperature (MAT) and altitude of 5.0 °C/km, ascale fac- 
tor of 0.7 and a AT at LGM of —3 °C. The actual A7 at LGM might have 
been more pronounced, again leading to a conservative estimate for 
the thermal age of Chuifeng Cave also. MAT was estimated based on the 
ten closest weather stations listed in publicly accessible World Mete- 
orological Organization (WMO) data (Extended Data Table 2). Thermal 
age calculations are, among other factors, altitude dependent, but only 
five out of these ten weather stations have altitudes directly associated 
with them. We therefore estimated the altitude of the other five weather 
stations through an online resource (https://www.advancedconverter. 
com/map-tools/find-altitude-by-coordinates). The correlation between 
WMOaltitude and estimated altitude was R? = 0.99, providing sufficient 
validity to our estimated altitudes. The MATs for all weather stations 
were then averaged to obtain an approximate MAT for Chuifeng Cave. 
Next, thermal age was calculated for chronological ages of 1.7 Myr, 1.9 
Myr and 2.1 Myr, giving estimates of the minimum (9.2 Myr@10 °C), 
maximum (15.0 Myr@10 °C) and mean (11.8 Myr@10 °C) thermal ages 
associated with the Chuifeng Cave fauna within a 95% confidence inter- 
val (Fig. 1). The Chuifeng Cave proteome is therefore, to our knowledge, 
substantially older than the oldest collagen peptide mass fingerprint 
(Ellesmere Island, 0.003 Myr@10 °C), oldest mammalian genome 


(Thistle Creek, 0.03 Myr@10 °C), oldest hominin genome (Sima de 
los Huesos, 0.25 Myr@10 °C) and oldest enamel proteome (Dmanisi, 
2.2 Myr@10 °C) published to date”. Full thermal age calculations can 
be found in Supplementary Data 2. 


Protein extraction 

Extraction of ancient proteins took place in facilities at the Natural 
History Museum of Denmark dedicated to extracting ancient DNA and 
ancient proteins. These laboratories include clean rooms fitted with 
filtered ventilation and positive air pressure®’. A negative extraction 
blank was processed alongside the ancient extractions, with the addi- 
tional inclusion of injection blanks during tandem mass spectrometry 
(MS/MS) analysis to monitor potential protein contamination during 
all stages of analysis. 

Two samples of enamel (185 and 118 mg) and one of dentine (192 mg) 
were removed from the same molar (CF-B-16) using a sterilized drill 
and crushed to a rough powder. One enamel sample and the dentine 
sample were demineralized in 1.2 M HCl at 3 °C for 24 h, and the other 
enamel sample was demineralized at the same temperature and dura- 
tion using 10% TFA. Subsequently, solubilized protein residues were 
cleaned, concentrated and immobilized on C18 Stage-Tips using previ- 
ously published methods”. No other samples from Chuifeng Cave were 
analysed before or during the analysis of CF-B-16. 


LC-MS/MS analysis 

The extracts were analysed by nanoflow liquid chromatography-tan- 
dem mass spectrometry (nanoLC-MS/MS) using a15-cm capillary col- 
umn (75 pm inner diameter, packed with 1.9 pm C18 beads (Reprosil-AQ 
Pur, Dr. Maisch)) on an EASY-nLC 1200 system (Proxeon) connected 
to a Q-Exactive HF-X mass spectrometer (Thermo Scientific). The 
nanoLC gradient and MS parameters followed a previously published 
Q-Exactive HF-X method*. System wash blanks were performed before 
and after every sample to hinder cross-contamination. 


Database construction 

We constructed a protein sequence database for Hominoidea proteins 
known to be present in enamel proteomes (Supplementary Table 2), 
to which we added the homologous sequences from one Cercopithe- 
coid (Macaca mulatta) as an outgroup for phylogenetic analysis. As 
few protein sequences are publicly available for Pongo pygmaeus, we 
predicted those sequences from publicly available genomic sequence 
data using the known gene coordinates of Pongo abelii homologues. 
Similarly, we generated de novo AMELY sequences for P. abelii and 
P. pygmaeus. Finally, we added common laboratory contaminants to 
allow spectra from such proteins to be confidently identified**. 


Ancestral sequence reconstruction. Previous research indicates 
that cross-species proteomic effects, observed during spectral 
identification, substantially reduce the identification of phyloge- 
netically informative amino acid positions at large evolutionary 
distances’. We reasoned that this was likely to occur in the case of 
Gigantopithecus proteins“, and therefore reconstructed the ancestral 
protein sequences of enamel-specific proteins. Ancestral sequence 
recontruction (ASR) was conducted across the entire Hominoidea 
phylogeny using PhyloBot*. Input sequences were constrained 
phylogenetically to (Macaca,(Nomascus,((Pongo abelii, Pongo 
pygmaeus),Gorilla,(Homo,((Pan paniscus, Pan troglodytes)))))). We 
added those sequences to the reference protein database to account 
for them in the database search of PEAKS and MaxQuant. 


Isoform variation. After obtaining complete protein sequences for all 
extant hominids, we added isoforms not present in UniProt or GenBank 
for the proteins AMELX, AMELY, AMBN, AMTN, KLK4 and TUFTL, includ- 
ing the reconstructed ASR sequences of these proteins, to the database. 
We assumed that the isoforms for these non-human hominids would 


result from identically placed alternative splicing across species and 
ancestral nodes (as also supported by all UniProt isoforms present 
for the studied proteins). Thus, we copied these alternative splicing 
sites onto the available reference sequences to create the missing iso- 
forms. Database sequence names for these proteins were appended 
with‘ Manlso2’ or‘ Manlso3’. 


Proteomic data analysis 

Raw mass spectrometry data were searched per sample type (enamel, 
dentine, extraction blank and injection blanks) against a sequence 
database containing all common enamel proteins for all extant homi- 
nids (see above). We used PEAKS“? (v.7.5) and MaxQuant” (v.1.6.2.6) 
software. The de novo and error-tolerant implementations of PEAKS, 
and the dependent peptide algorithm implemented in MaxQuant, 
were used to generate possible, additional, SAP variation in enamel 
protein sequences. Such novel SAPs could represent unique amino 
acid substitutions on the Gigantopithecus lineage, which are not rel- 
evant to its phylogenetic placement but are relevant for dating the 
Pongo-Gigantopithecus divergence. Next, these potential sequence 
variants were added to a newly constructed sequence database and 
verified in separate searches in PEAKS and MaxQuant. We defined 
as variable modifications methionine oxidation, proline hydroxyla- 
tion, glutamine and asparagine deamidation, pyro-glutamic acid 
from glutamic acid, pyro-glutamic acid from glutamine and phos- 
phorylation (STY). No fixed modifications were selected. We did not 
use an enzymatic protease during sample preparation, therefore the 
digestion mode was set to ‘unspecific’. For PEAKS, peptide spectrum 
matches were only accepted with a false-discovery rate (FDR) <1.0%, 
and precursor mass tolerance was set to 10 ppm and fragment mass 
tolerance to 0.05 Da. For MaxQuant, peptide spectrum matches and 
protein FDR were set at < 1.0%, with aminimum Andromeda score of 
40 for all peptides. Protein matches were accepted with a minimum 
of two unique peptide sequences in at least one of the MaxQuant or 
PEAKS searches, including the removal of non-specific peptides after 
BLASTp searches of peptides matching non-enamel proteins against 
UniProt and GenBank databases. Proteins that are retained after apply- 
ing these criteria are listed in Extended Data Table 1. Examples of 
annotated MS/MS spectra after MaxQuant analysis can be found in 
Supplementary Figs. 3-12. 

Assessment of protein damage and degradation followed protocols 
explained elsewhere’? and included rates of deamidation and acom- 
parison of observed peptide lengths. GRAVY index scores of peptide 
hydrophobicity were calculated using the R package Peptides, with 
the scale set to ‘KyteDoolittle’. 


Phylogenetic and divergence analysis 

Comparative reference dataset. We assembled a reference dataset 
with five protein sequences retrieved from the ancient sample (AMBN, 
AMELX, AMTN, ENAM and MMP20) and relevant extant species (Sup- 
plementary Table 2). Protein sequences for human (Homo sapiens), 
common chimpanzee (P. troglodytes), bonobo (P. paniscus), Sumatran 
orangutan (P. abelii), Western gorilla (Gorilla gorilla), rhesus macaque 
(M. mulatta) and the white-cheeked gibbon (Nomascus leucogenys) 
were obtained from the UniProt database. Additionally, we expanded 
our dataset with protein sequences from publicly available whole- 
genome sequence data from present-day great apes (in total 27 oran- 
gutans, 42 gorillas, 11 bonobos and 61 chimpanzees”), as well as 19 
human individuals from the Simons Genome Diversity Project (SGDP)*. 
See Supplementary Information for the human sample numbers taken 
from the SGDP dataset. 


Reconstruction of protein sequences from whole-genome 
sequencing data. DNA sequence reads for reference samples used 
were mapped to the human genome (version hg19) using BWA-MEM 
v.0.7.5a-1405 (http://bio-bwa.sourceforge.net/bwa.shtml) with default 


parameters. PCR and optical duplicates were identified and removed 
using PICARD v.1.91 (https://sourceforge.net/projects/picard/files/pic- 
ard-tools/1.91/). Single-nucleotide polymorphisms were called onthe 
read alignments using the GATK UnifiedGenotyper (https://software. 
broadinstitute.org/gatk/documentation/tooldocs/current/org_broa- 
dinstitute_gatk tools walkers_genotyper_UnifiedGenotyper.php). 

To reconstruct the protein sequences from the genotype calls, we 
first created a consensus sequence for each of the five genes of interest 
and for each sample. Indels were not considered and a random allele 
was chosen at heterozygous positions. Next, we removed the intron 
sequences from each gene using the annotation of the reference human 
genome (hg19) available in the ENSEMBL database. For each of the in 
silico spliced genes, we performed atblastn search” using the human 
reference protein as the query. Finally, we obtained the translated pro- 
tein sequences from the resulting alignments. 


Assessing the phylogenetic position of G. blacki. We compared the 
G. blacki protein sequences with the corresponding homologues of 
the species in the reference panel. For each gene, we built two multiple 
sequence alignments using mafft”. The first incorporated all samples 
inthe reference panel (n=164). The second incorporated only asingle 
sample per species (Supplementary Table 3). To account for isobaric 
amino acids (leucines and isoleucines), which cannot be distinguished 
in the ancient protein data, we changed all isoleucines to leucines at 
positions where the ancient sample carried either of those amino acids. 
To assess the phylogenetic position of the ancient sample, two inference 
approaches were used: a maximum-likelihood and a Bayesian inference. 

Maximum-likelihood approach. PhyML v.3.1*° was used to infer a 
maximum-likelihood tree, branch lengths and substitutions rates for 
each individual protein alignment (Supplementary Fig. 2) and for the 
concatenated alignment. For each alignment, we started from three 
random trees (--n_rand_starts 3 -s BEST --rand_start), used the JTT 
model (-mJTT -fm), and obtained maximum likelihood estimates for 
the gamma distribution shape parameter (-a e) and the proportion of 
invariable sites (-v e). Support values were obtained for each biparti- 
tion based on 100 non-parametric bootstrap replicates. The bootstrap 
results per branch split are shown in Extended Data Fig. 6b. 

Bayesian approach. As a complementary approach, we used 
MrBayes” and the concatenated alignment to infer the phylogenetic 
position of the ancient sample (Fig. 3, Extended Data Fig. 6b). We set 
an independent bipartition for each gene and estimated: substitu- 
tion rates, across-site rate variation and the proportion of invariable 
sites (unlink Statefreq = (all) Ratemultiplier = (all) Aamodel = (all) 
Shape = (all) Pinvar = (all)). MrBayes was executed using the CIPRES 
portal’°. The MCMCalgorithm was set to 5,000,000 cycles with 4 chains 
and atemperature parameter of 0.2. The convergence of the algorithm 
was assessed using Tracer v.1.6.0 after discarding 25% of the iterations 
as burn-in. MrBayes was run against the reference sequence for each 
species (Extended Data Fig. 6c) or against 162 great ape individuals, 
one hylobatid and one cercopithecid (Extended Data Fig. 7). Both of 
these analyses, as well as the PHYML maximum likelihood approach, 
resulted in the same topology. The analysis using a large number of 
individuals shows, however, that resolution within the genus Pongo is 
limited (Extended Data Fig. 7). Nevertheless, the placement of Gigan- 
topithecus is fully supported. 


Divergence time of Gigantopithecus. We estimated the divergence 
time between Gigantopithecus and the Pongo branch first by using a 
distance-based approach. We used the alignment of the amino acid 
sequences of reference genome sequences for each species as well 
as diversity data (see above). A distance matrix was created from the 
concatenated protein sequences of all individuals using the function 
dist.ml from the R package phangorn™ under the LG amino acid sub- 
stitution model”. We used pairwise exclusion to increase the amount 
of data for the present-day branches. We then calculated the mean 
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difference of all orangutan sequences to all sequences from Homo, 
Panand Gorilla, andthe mean difference of all orangutan sequences to 
Gigantopithecus (Extended Data Fig. 6a). We used the average distance 
between orangutan and the other extant great apes as a Scaling factor, 
assuming a divergence time between these branches® of 23.8 Ma. Under 
this assumption, the molecular divergence of Gigantopithecus from the 
Pongo branchis 9.98 Ma. However, because Chuifeng Cave is dated to 
1.9 Ma, this branchis likely to be underestimated and its age needs to be 
corrected to 11.88 Ma. We combine the 95% confidence interval of the 
distance matrix with the 95% confidence interval of the mutation rate 
estimate, and add the upper and lower values of the 95% confidence 
interval for the Chuifeng Cave dating (1.7-2.1 Ma), and therefore sug- 
gest conservative upper and lower boundaries for the divergence of 
8.91 and 15.65 Ma, respectively. 

If mutation rates did not substantially differ between extant Pongo 
and Gigantopithecus, this estimate should reflect the molecular evolu- 
tion of their common branch. We calculated the divergence between 
the other great apes, taking into account the mutation rate differences 
on these lineages as scaling factors*’. The resulting divergence time 
between Gorilla and the Homo/Pan branch is estimated at 10.27 Ma 
(7.9-13.25 Ma, 95% confidence interval), and the divergence between 
Homo and Pan at 8.72 Ma (8.06-13.81 Ma, 95% confidence interval). 
These values are in strong agreement with previous estimates”, sug- 
gesting that these protein sequences represent well the known phylog- 
eny of the great apes. Clearly, all divergence time estimates scale with 
assumptions of mutation rates. We also caution that the small number 
of mutations in the peptide fragments in Gigantopithecus constitutes 
asevere limitation on the precision of these estimates on this branch. 
However, the phylogenetic position of Gigantopithecus as a sister clade 
to orangutans is also well supported in this analysis: a phylogenetic tree 
froma distance matrix of the reference sequences for these species 
(neighbour joining tree in phangorn; maximum likelihood computed 
with the pml function; 1,000 bootstrap replicates) separates Giganto- 
pithecus from orangutans with 100% bootstrap support. 

We used the program MrBayes” to estimate divergence time esti- 
mates ina Bayesian framework using the reference genome sequences. 
We defined M. mulatta as outgroup, grouped Pan, Homo and Gorilla 
together as well as Pongo and Gigantopithecus, and set the divergence 
time of the two groups with a uniform distribution of 17.739-26.061 
Ma, using a previously published estimate”. Furthermore, we set the 
divergence time of the macaques and apes at 26.061-39.9 Ma (fromthe 
maximum divergence time of the hominids to a very high divergence 
time of the apes). We used a variable mutation rate and the VT amino 
acid substitution model* in five million iterations. This results ina 
divergence time for Gigantopithecus—Pongo of 10.14 Ma (4.76-15.79 
Ma, 95% HPD interval). The divergence of Gorilla from the Homo/Pan 
branchis estimated at 8.59 Ma (4.62-13.56 Ma, 95% HPD interval), and 
the divergence of Homo and Pan at 5.78 Ma (2.64-9.53 Ma, 95% HPD 
interval). These are largely consistent with, but slightly younger than, 
previous estimates», possibly owing to a mutation slowdown in these 
lineages compared to the Pongo lineage, whichis not takeninto account 
here. However, they seem in agreement with the fossil record indicating 
the origin of hominins around 6-8 Ma and the dating of a possible early 
Gorillini (Chororapithecus) around 7-9 Ma>’*°. Therefore, we conclude 
that the relative branch lengths of the tree (Fig. 3b) are concordant with 
the overall phylogeny and the estimates presented above. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All the mass spectrometry proteomics data have been deposited 
in the ProteomeXchange Consortium (http://proteomecentral. 


proteomexchange.org) via the PRIDE partner repository with the 
dataset identifier PXDO13838. Generated ancient protein consensus 
sequences for G. blacki can be found in Supplementary Data 1. Full 
thermal age calculations can be found in Supplementary Data 2. 
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Extended Data Fig. 1| The current environment of Chuifeng Cave. 
a, Landscape outside Chuifeng Cave. b, Elevated altitude of Chuifeng Cave 
(arrow points to the entrance). Photo credit: W.W. 


Extended Data Fig. 2 | Excavations in Chuifeng Cave. a, Main entrance of Chuifeng Cave. b, Well-preserved deposits before excavation. c, The stratigraphic 
profile (1.3 min height) of area D. d, W.W. excavating in area D.e, Excavated channel. f, g, In situ G. blackiteeth (scale bars, 3 cm). Photo credit: W.W. 
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Extended Data Fig. 3| Total ion current chromatograms of the analysed samples. a, HCl extract. b, TFA extract. Note differences inmaximum totalioncurrents 
on they axes. Each extract was analysed only once. 
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Extended Data Fig. 4| Comparison of HCl and TFA extraction protocols. 

a, Summed and normalized peptide intensities for each combination of 
peptide length and number of acidic residues (aspartic acid, glutamic acid, 
deamidated asparagine, deamidated glutamine). Circle sizes are proportional 
tothe percentage of the total intensity, for each combination of peptide length 
and number of acidic residues. b, Summed and normalized intensities by 
peptide length. c, Summed and normalized peptide intensities across peptide 
hydrophobicity (GRAVY index values calculated using the R package Peptides, 
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scale ‘KyteDoolittle’). Insets show peptide count distribution across peptide 
hydrophobicity. d, Extraction performance for various data categories. Values 
scaled to one and compared to the best-performing extraction method for 
eachcategory independently. SAPs refer to those SAPs informative within 
Hominoidea. e, Proportional Venn diagram of unique peptide sequences 
identified in the two demineralization methods. All comparisons based on 
MaxQuant LFQ data only. N, number of peptides. 


Article 


a0 4 
25 4 
a 
3 
8 
3 20 = 
5 rc 
s 15 - 
D 
E tra 
Ss 10-4 
a 
Ly 
a 
5 
0 eo 
T T 
HCl TFA 
b c 
18 LSS SSS SS ¢ 100: =) —— > 
75 4 75 - 
Endogenous Endogenous 
cS enamel-specific proteins cy enamel-specific proteins 
© © 
2 So 
3 50- g 505 
& 5. 
oO oO 
® ® 
a a 
254 sk 25 | SE 
Os Os 
= s 
= Es 
35 35 
zs =8 
04 — 0-4 oo— 
8, 11 11, 14 16 | 143 1, 3 24 , 36 46 , 75 48 , 105 W @.2 
AMBN AMELX ENAM DCD AMBN AMELX ENAM DCD 


Extended Data Fig. 5| Damage characteristics of HCl and TFA extraction 
protocols. a, Comparison of peptide lengths, showing an identical distribution 
for the TFA (n=305) and HCI (n=191) extractions (two-sided t-test, t35,=—0.599, 
P=0.5495).b, Comparison of asparagine deamidation.c, Comparison of 
glutamine deamidation. b,c, Violin plots describe the distribution of bootstrap 
replicates (n=1,000) of intensity-based peptide deamidation®”. The number of 
peptides used for the calculation are shown at the bottom. For some proteins, 


only deamidated asparagines or glutamines were observed (for example, 
AMBN), while DCD is included as an example of anon-deamidated 
contaminant. All comparisons based on MaxQuant data only. For a-c, box plots 
define the range of the data (whiskers extending to 1.5x the interquartile 
range), outliers (beyond 1.5x the interquartile range), 25th and 75th percentiles 
(boxes) and medians (dots). 
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Extended Data Fig. 6 | Unconstrained phylogenetic analysis of G. blacki. 

a, Pairwise distances between groups of selected Hominoids and Pongo 
estimated using the concatenated protein alignments and the phangornR 
package. n shows number of pairwise comparisons. b, Maximum-likelihood 
tree computed ona distance matrix using pmI R function. Support values were 
obtained from1,000 bootstrap replicates. c, Rooted phylogenetic tree 
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obtained using MrBayes. For each bipartition, we show the posterior 
probability (0-1) obtained from the Bayesian approach and the support values 
obtained from 100 non-parametric bootstrap replicates ina PHyYML maximum- 
likelihood (0-100) tree. PHyML and MrBayes recover the same topology. b and 
care based onthe same concatenated alignment of the five proteins retrieved 
from Gigantopithecus, and resulted in the same tree topology. 
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Extended Data Table 1| Enamel proteome sequence coverage 


MaxQuant MaxQuant PEAKS PEAKS Combined 
Protein Primary entry Protein accession peptides (all amino acids peptides (all amino acids sequence coverage 
unique unique % 
AMELX H2PUX0_PONAB H2PUX0 149 135 (4) 270 141 (10) 70.7 
AMBN H2PDI5_PONAB H2PDI5 55 105 (15) 79 107 (11) 27.5 
AMTN H2PDI4_PONAB H2PDI4 2 18 (0) 2 18 (0) 8.6 
ENAM H2PDI6_PONAB H2PDI6 125 129 (5) 189 181 (57) 16.3 
MMP20 H2NF32_PONAB H2NF32 2 9 (0) 1 9 (0) 1.9 
AHSG H2PC98_PONAB H2PC98 7 13 (0) 12 13 (0) 3.5 
ALB ALBU_Bovin P02769 2 
DCD DCD_human P81605 3 8 
B2MG B2MG_human P61769 2 
K1C9 K1C9_human P35527 3 


Only proteins with two unique peptides in at least either MaxQuant or PEAKS searches were accepted. No protein matches in the dentine or blank extractions fulfilled this criterion. Primary 
entries for proteins used for phylogenetic reconstruction refer to the P. abelii accessions in UniProt for reference purposes. Protein sequence coverage in the final column indicates the coverage 
obtained after combining PEAKS and MaxQuant peptide recovery and is only reported for proteins considered endogenous. ALB, DCD and K1C9 are considered to be contaminants. For amino 
acid columns, numbers in brackets refer to amino acid positions uniquely identified in PEAKS or MaxQuant searches. 


Extended Data Table 2 | MAT estimation at Chuifeng Cave 


WMO WMO Longitude Latitude WMO Estimated First Last month MAT Altitude Chuifeng 
stationID station altitude altitude (m) month source Cave MAT 
name (m) used 

5920901 Ching His 106.42E 23.13N 740 743 February October 19.5 WMO 22.8 
1981 1990 

5698505 Hekou 103.95E 22.5N 137 114 January December 22.6 WMO 22.1 
1961 1970 

5791601 Tien-O 107.17E 25.0N 305 245 January October 19.7 WMO 20.0 
1981 1990 

5904602 Ta Wan 109.42E 23.85N 76 81 January October 20.8 WMO 20.0 
1981 1990 

5963201 Tung Hsing 107.97E 21.55N 13 10 February October 23.0 WMO 22.0 
1981 1990 

5921100 Bose 106.6E 23.9N na 154 January October 22.5 Estimated 22.2 
1961 1990 altitude 

5920901 Napo 105.95E 23.3N na 1214 January October 19.6 Estimated 24.5 
1981 1990 altitude 

5900700 Guangnan 105.07E 24.07N na 1257 January October 17.6 Estimated 22.8 
1981 1990 altitude 

5943100 Nanning 108.35E 22.82N na 81 January November 22.0 Estimated 21.2 
1922 1993 altitude 

5902300 Hechi 108.05E 24.7N na 204 January October 21.2 Estimated 21.1 


1981 1990 altitude 


The ten geographically closest meteorological weather stations included in publicly available WMO data were used to estimate current MAT at Chuifeng Cave. Correlation between online 
altitude estimation and WMO provided altitude is R? = 0.99 (Pearson correlation). 
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Vision, choice, action and behavioural engagement arise from neuronal activity that 
may be distributed across brain regions. Here we delineate the spatial distribution of 
neurons underlying these processes. We used Neuropixels probes’” to record from 
approximately 30,000 neurons in 42 brain regions of mice performing a visual 
discrimination task’. Neurons in nearly all regions responded non-specifically when 
the mouse initiated an action. By contrast, neurons encoding visual stimuli and 
upcoming choices occupied restricted regions in the neocortex, basal ganglia and 
midbrain. Choice signals were rare and emerged with indistinguishable timing across 
regions. Midbrain neurons were activated before contralateral choices and were 
suppressed before ipsilateral choices, whereas forebrain neurons could prefer either 
side. Brain-wide pre-stimulus activity predicted engagement in individual trials and in 
the overall task, with enhanced subcortical but suppressed neocortical activity during 


engagement. These results reveal organizing principles for the distribution of 
neurons encoding behaviourally relevant variables across the mouse brain. 


Performing a perceptual decision involves processing sensory infor- 
mation, selecting actions that may lead to reward, and executing these 
actions. It remains unknown how the neurons mediating these pro- 
cesses are distributed across brain regions, and whether they rely on 
circuits that are shared or distinct. Most studies of action selection 
(hereafter referred to simply as choice) have focused on individual 
regions, suchas frontal, parietal and motor cortex, basal ganglia, thala- 
mus, cerebellum and superior colliculus* “. However, neural correlates 
of movements, rewards, and other task variables have been observed 
in multiple brain regions, including in areas previously identified as 
purely sensory”, It is therefore possible that many brain regions 
also participate in action selection. Nevertheless, neuronal signals that 
correlate with action do not necessarily correlate with choice. To carry 
choice-related signals, a brain region must contain neurons whose fir- 
ing selectively predicts the chosen action before the action occurs”. 
Successful performance in a perceptual task depends not only on 
choosing the correct action, but also on choosing to engage in the task 
in the first place. Stimuli that drive actions during an engaged behav- 
ioural state do not necessarily drive actions when disengaged; for exam- 
ple, in contexts where the action will not lead to reward. Furthermore, 
even well-trained participants often show varying levels of behavioural 
engagement or vigilance within a task, resulting in varying probability 
of responding promptly and accurately to sensory stimuli”**. At times 
of low engagement, stimuli arriving at the sense organs evidently fail 
to effectively drive the circuits responsible for selecting and initiating 
action. It remains unclear whether this context-dependent gating occurs 
globally” or whether it involves multiple brain systems differentially. 


Brain-wide recording in visual behaviour 


To determine the distribution of neurons encoding vision, choice, 
action, and behavioural engagement, we recorded neural activity across 


the brain while mice performed a task that enabled us to distinguish 
these processes (Fig. la—c). This task combines the advantages of two- 
alternative forced choice and Go-NoGo designs?". On each trial, visual 
stimuli of varying contrast could appear on the left side, right side, both 
sides or neither side. Mice earned a water reward by turning a wheel with 
their forepaws to indicate which side had highest contrast (Fig. la—c). If 
neither stimulus was present, they earned a reward for making a third 
type of response: keeping the wheel still for1.5s. If left and right stimuli 
had equal non-zero contrast, the mice were rewarded randomly for left 
or right turns. The same visual stimulus could therefore lead to either 
direction of turn, or to no action, enabling us to dissociate the neural 
correlates of visual processing, of action initiation (turning the wheel 
versus holding it still) and of action selection (turning left versus right). 

Mice performed the task proficiently (Fig. 1c, Extended Data Fig. 1). 
Their choices were most accurate when stimuli appeared ona single 
side at high contrast (1.7 + 2.5% incorrect choices, defined as turns in 
the wrong direction; 10.1 + 8.3% Misses, defined as failures to turn; 
mean +s.d.,n=39 sessions, 10 mice). They performed less accurately in 
more challenging conditions: with low-contrast single stimuli (5.1+ 6.4% 
incorrect choices; 29.8 + 19.8% Misses); or with competing stimuli of 
similar but unequal contrast (20.0 + 7.9% incorrect choices; 13.9 + 11.7% 
Misses, on trials with high versus medium or medium versus low con- 
trast). As expected in these more challenging cases, reaction times 
were longer (Fig. 1d; P< 10“, multi-way ANOVA). 

While mice performed the task, we used Neuropixels probes’ to record 
from approximately 30,000 neurons in 42 brain regions (Fig. le-j; abbre- 
viations of brain regions are defined in Extended Data Table 1). Inserting 
two orthree probes at atime inthe left hemisphere yielded simultaneous 
recordings from hundreds of neurons in multiple regions during each 
recording session (n = 92 probe insertions over 39 sessions in 10 mice, 
Fig. 1h, i). We identified the firing times of individual neurons using Kilo- 
sort” and phy”, and determined their anatomical locations by combining 
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Fig. 1| Brain-wide recordings during a task that distinguishes vision, choice 
and action. a, Mice earned water rewards by turning a wheel to indicate which of 
two visual gratings had higher contrast, or by not turning ifno stimulus was 
presented. When stimuli had equal contrast, aleft or right choice was rewarded 
with 50% probability. Grey rectangles indicate the three computer screens 
surrounding the mouse. Arrows (not visible to the mouse) indicate the rewarded 
wheel turn direction and the coupled movement of the visual stimulus (X 
indicates reward for noturn), and the coloured dashed circle (not visible to the 
mouse) indicates the stimulus location at whicha reward was delivered. 

b, Timeline of the task. Mice were free to move as soon as the stimulus appeared, 
but the stimulus was fixed in place and rewards were unavailable until after an 
auditory tone cue. Ifno movement was made for1.5s after the tone cue, a NoGo 
was registered. The grey region is the analysis window, from 0to0.4s after 
stimulus onset. CW, clockwise; ACW, anticlockwise. c, Average task 
performance across subjects; n=10 mice, 39 sessions, 9,538 trials. Colour maps 
depict the probability of each choice given the combination of contrasts 
presented. d, Reaction time asa function of stimulus contrast and presence of 


electrophysiological features with histological reconstruction of fluores- 
cently labelled probe tracks (Fig. 1g, Extended Data Figs. 2,3). Across all ses- 
sions, we recorded from 29,134 neurons (n= 747 +38 neurons per session, 
mean +s.e.), of which 22,458 were localizable to one of 42 brain regions. 


Propagation of activity during the task 


Trial onset was followed by increased average activity in nearly all 
recorded regions. A sizeable fraction of all neurons (60.0%, 13,466 
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competing stimuli.e, Mice were head-fixed with forepaws on the wheel while 
multiple Neuropixels probes were inserted for each recording. f, Frontal view of 
subject performing the behavioural task during recording, with forepaws on 
wheel and lick spout for acquiring rewards. g, Example electrode track histology 
with atlas alignment overlaid. h, Recording track locations as registered to the 
Allen Common Coordinate Framework 3D space. Each coloured line represents 
the span recorded by asingle probe ona single session, coloured by mouse 
identity. D, dorsal; A, anterior; L, left. i, Summary of recording locations. 
Recordings were made from each of the 42 brain regions coloured on the top- 
down view of cortex (left) and sagittal section (right). For each region, the 
number in parentheses indicates total recorded neurons. For abbreviations, see 
Extended Data Table 1.j, Spike raster from an example individual trial, in which 
populations of neurons were simultaneously recorded across visual and frontal 
cortical areas, hippocampus and thalamus. Brain diagrams were derived from 
the Allen Mouse Brain Common Coordinate Framework (version 3 (2017); 
downloaded from http://download.alleninstitute.org/informatics-archive/ 
current-release/mouse ccf/). 


neurons) showed significant modulation of firing rate during the 
task (P< 0.05; Bonferroni-corrected test for six task epochs; Supple- 
mentary Figs. 1,2; Methods). Most of these neurons (74.3%) consist- 
ently increased their activity during the task, but a sizeable minority 
(20.2%) consistently decreased their activity. Neurons were diverse in 
the timing of their activity during trials, with timing differences both 
within and between brain areas (Extended Data Fig. 4a, b); however, 
neuronal activity was detectable before the onset of wheel movement 
in most regions (Extended Data Fig. 4c, d). Similarly widespread 
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Fig. 2| Activity propagates froma visual pathway to the entire brain during 
task performance. a—c, Rasters showing activity of three example neurons 
following visual stimuli presented onthe contralateral or ipsilateral side alone, 
oncorrect choice trials (when they evoked wheel turns inthe correct direction), 
Miss trials (when mice failed to respond in the task context) and when stimuli 
were presented ina passive context with no opportunity to earn reward. Top six 
panels: aligned to stimulus onset, black dots represent movement. Bottom two 


activity was observed following reward delivery (Extended Data 
Fig. 4e). 

Examining rasters of individual neurons’ activity across trials 
revealed consistent correlates of action initiation, sensory stimuli or 
choices (Fig. 2a—c). For example, a neuron in the subiculum (Fig. 2a) 
produced no response to the visual stimuli, but consistently fired 
before wheel turns regardless of their direction. Such non-specific 
movement correlates were also often seen in neurons that produced 
sensory responses. For example, a neuron in visual cortex (Fig. 2b) 
showed activity following onset of visual stimulus that was selective 
for stimulus location but also fired following wheel turns, regardless 
of the subject’s choice (that is, direction of wheel turn). Neurons with 
choice-selective responses were rare but could be found in select nuclei: 
for example, a neuron in the zona incerta (ZI; Fig. 2c) showed no visual 
response but increased its firing rate before contralateral choices, with 
no response before or after ipsilateral choices. 

Throughout the brain, most of the activity following trial onset 
reflected non-specific movement correlates (Fig. 2d—h). When a mouse 
successfully selected a visual stimulus contralateral to the recorded 
hemisphere, activity emerged first in classical visual regions such as 
visual cortex (VIS) and superficial superior colliculus (SCs), and soon 
spread to most of the remaining recorded regions (Fig. 2d). When 
the mouse successfully selected an ipsilateral stimulus, most areas 
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panels: aligned to movement, black dots represent stimulus onset. d-h, Colour 
maps showing firing rates averaged over responsive neurons in each region 
and over trials of the indicated type. Contralateral visual stimulus contrasts 
were matched between d, fand hso that differences in activity donot reflect 
differing visual drive. Subpanels to the right of each colour map represent the 
percentage of neurons in each area significantly more responsive during that 
condition than baseline (P< 10~*; see Methods). 


were again activated, but VIS and SCs were now among the last areas 
to respond, rather than the first (Fig. 2e). When the mouse missed a 
contralateral stimulus, leading to no action, activity was found ina 
‘visual pathway’ consisting of classical visual areas, basal ganglia and 
several midbrain structures (Fig. 2f), but failed to propagate globally. 
When the mouse missed an ipsilateral stimulus, however, the recorded 
hemisphere remained largely silent (Fig. 2g). The widely distributed 
activity seen following trial onset was therefore present only when mice 
moved, regardless of the particular stimulus and particular action. 

Outside of the task context, responses to visual stimuli were similar 
to those of Miss trials, but generally weaker (Fig. 2h). We measured 
activity in passive replay periods following task performance, when 
the same stimuli were presented without the opportunity to earn 
rewards. In these passive trials, the mice hardly ever turned the wheel 
(94.1% + 0.6% of trials with high contrast stimuli had no movement). 
Stimuli contralateral to the recorded hemisphere gave rise to weak 
activity restricted to the visual pathway (Fig. 2h). No activity was seen 
on average following passive presentation of stimuli ipsilateral to the 
recorded hemisphere (not shown). 

In sum, this analysis of average activity suggests that whereas 
responses to visual stimuli are largely confined to a restricted visual 
pathway, neural correlates of action initiation are essentially global. To 
assess the distribution of these signals at a finer scale and to search for 


signals encoding choice, we next examined the activity of individual 
neurons. 


Globally distributed action coding 


To analyse the firing correlates of individual neurons, we used an 
approach based on kernel fitting (Extended Data Fig. 5). We fit the 
activity of each neuron with a sum of kernel functions time-locked 
to stimulus presentation and to movement onset. We fit six stimu- 
lus-locked kernels—one for each of three possible contrast values on 
each side (‘Vision’ kernels), which captured variations in amplitude 
and timing of the visual activity driven by different stimuli. We fit two 
movement-locked kernels: an ‘Action’ kernel triggered by amovement 
in either direction anda ‘Choice’ kernel capturing differences in activity 
between left and right movements. 

To determine which neurons encoded vision, action and choice, we 
used a nested test: we fit a model including all kernels except the one to 
be tested and determined whether adding the test kernel improved the 
model's prediction of held-out data. Applying this test to the example 
neurons from before, we find that this method succeeded in quantifying 
the contralateral visual stimulus (Fig. 3a) and action (Fig. 3b) correlates 
inferred from examining each trial type (compare with Fig. 2a, b). In 
determining whether a neuron passed this test, we used parameters 
that gave false-positive error rates of 0.33% on shuffled data (Extended 
Data Fig. 5h). As our question concerns activity predictive of upcoming 
movements, we applied this analysis only to pre-movement activity. 
Consistent with its raster plot (Fig. 2a), the example subicular neuron 
examined earlier required only an Action kernel, indicating entirely 
non-selective action correlates (Fig. 3b). By contrast, the example 
visual cortical neuron (Fig. 2b) required only Vision kernels (Fig. 3a), 
indicating that it had exclusively visual correlates before action initia- 
tion. The fraction of cross-validated variance explained by the kernels 
was frequently small (Fig. 3c, Extended Data Fig. 6a), even for neurons 
whose mean rates they accurately predicted (50.2% for the neuronin 
Fig. 3a and 13.6% for the neuron in Fig. 3b), as expected from trial-to-trial 
variability and encoding of task-independent variables“**. 

Neurons encoding vision (that is, requiring Vision kernels) were 
found ina pathway comprising primarily classical visual areas (Fig. 3d, 
f). They were common in VIS, thalamus and superficial superior col- 
liculus (SCs), but were also occasionally present in other structures 
such as frontal cortex (MOs, ACA and PL), basal ganglia (CP, GPe and 
SNr), and several midbrain nuclei (SCm, MRN, APN and ZI; Fig. 3d, f, 
Extended Data Fig. 6a; brain regions are listed in Extended Data Table 1). 

By contrast, neurons encoding action (requiring an Action kernel) 
were spread throughoutall recorded regions (Fig. 3e, g). The distribu- 
tion of these neurons encoding action was significantly broader than 
that of neurons encoding visual stimuli (Extended Data Fig. 4f). Alarge 
majority of neurons encoding action did not require an additional 
Choice kernel—they responded equally for movements in either direc- 
tion. The rare exceptions requiring a Choice kernel are discussed next. 


Choice coding in forebrain and midbrain 


Neurons encoding specific choices were found ina small subset of brain 
regions (Fig. 4a, b). We identified choice-selective neurons as neurons 
for which the Choice kernel was required to explain their activity in 
the nested test described above. These neurons were rare and were 
distributed across frontal cortex (MOs, PL and MOp), basal ganglia (CP 
and SNr), higher-order thalamus and motor-related superior colliculus 
(SCm), as well as in two unexpected subcortical nuclei, the midbrain 
reticular nucleus and ZI (Fig. 4b). This set of regions encoding choice 
overlapped partially with the visual pathway—both included frontal cor- 
tex, basal ganglia and several midbrain structures, but choice-selective 
neurons were not found in VISp. Neurons encoding choice were again 
significantly more localized than neurons encoding action (Extended 


Data Fig. 4f). To further confirm these conclusions, we developed a 
version of choice-probability analysis for tasks with many stimulus 
conditions, called ‘combined-conditions choice probability’ (Methods). 
This statistic quantifies the probability that a neuron’s spike count 
will be greater on trials with one choice than with other choices, for 
matched stimulus conditions as in classical choice probability. This 
analysis produced similar results (Extended Data Fig. 7). 

Across choice-encoding regions, choice signals emerged with similar 
timing (Fig. 4c, d). We trained a decoder to predict the subject’s choice 
from recorded population activity after first subtracting the prediction 
of population activity from the Vision and Action kernels (to yield a 
decoding of choice isolated from visual and non-specific action sig- 
nals). This population-level decoding identified similar areas encoding 
each variable as did the individual neuron decoding (Extended Data 
Fig. 5g), and we found that the time course of choice decoding was 
not significantly different across choice-selective regions in frontal 
cortex, striatum and midbrain (two-way ANOVA on brain region and 
time, interaction P> 0.05; Fig. 4c). We validated this conclusion using 
joint peri-event canonical correlation (jPECC) analysis, an extension of 
the joint peristimulus time histogram’ method**”*, modified to detect 
correlations ina ‘communication subspace” between two populations. 
Whereas jPECC analysis revealed a consistent time lag for activity cor- 
relations between visual and frontal cortex (and between visual cortex 
and midbrain choice areas), it revealed no lag for activity correlations 
between frontal and midbrain areas (Fig. 4d, Extended Data Fig. 8). 

Although the encoding of choice emerged with indistinguishable 
timing in midbrain and forebrain, these regions encoded choice dif- 
ferently (Fig. 4e-h). In midbrain (MRN, SCm, SNr and ZI), nearly all 
choice-selective neurons (53 out of 54, 98%) preferred contralateral 
choices (Fig. 4e, top and Fig. 4f). By contrast, choice-selective neurons 
in forebrain (MOs, PL, MOp and CP) could prefer either choice, witha 
sizeable proportion preferring ipsilateral choices (19 out of 48, 40%), 
significantly more than in the midbrain (P< 10°, Fisher’s exact test; 
Fig. 4e, bottom and Fig. 4g). Moreover, many midbrain choice-selec- 
tive neurons exhibited directionally opposed activity: their activity 
increased before one choice and decreased below baseline before the 
other (29 out of 54, 54%; note points to the left of x= 0 in Fig. 4f). By 
contrast, neurons in the forebrain typically increased firing before both 
left and right choices (10 out of 48, 21% suppressed for non-preferred 
choice, significantly less than in the midbrain; P< 10°, Fisher’s exact 
test; Fig. 4g). Neurons encoding choice, therefore, exhibit a distinctive 
bilateral encoding of both choices in the forebrain, versus a unilateral 
encoding of contralateral choices in the midbrain (Fig. 4h). 


Distributed coding of task engagement 


We next investigated whether engagementin atrial or inthe overall task 
corresponded to characteristic patterns of brain activity. We reasoned 
that Go trials (when the mouse made aleft or right choice), Miss trials 
and passive visual responses (measured outside the task) might repre- 
sent three points along a continuum corresponding to progressively 
lower levels of task engagement. 

We began by comparing two conditions in which visual stimuli and 
behavioural reports were identical: passive visual responses measured 
outside the task, and responses on Miss trials during the task (Fig. 5a-c). 
Even though the two conditions were matched for visual stimulation 
and (lack of) action, they were accompanied in many areas by different 
activity, both before and after stimulus presentation (Fig. 5a, Extended 
Data Fig. 6b). Consistent with the average firing rates presented earlier 
(Fig. 2f, h), more neurons were significantly activated by visual stimuli 
during the task (Miss trials), than during passive stimulation (Fig. 5b). 
Differences were also seen in pre-stimulus firing rates; for instance, 
pre-stimulus activity in VISp was lower in the task (Miss trials) than 
during passive stimulation, whereas pre-stimulus activity in CP showed 
the opposite modulation (Fig. 5a). Whereas neocortex and sensory 
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Visual encoding Neurons (%) 


Fig. 3 | Neurons encoding vision are localized but neurons encoding action 
are found globally. a, Example of regression analysis for the example VISpm 
neuron shownin Fig. 2b. Firing rate was averaged (solid thin line, mean; shaded 
regions, s.e.m. across trials) across the trial types indicated: all trials with 
contralateral or ipsilateral stimuli (brown or blue; left panels), and all trials with 
contralateral or ipsilateral choices (orange or blue, right panels). Top graphs 
show mean firing rate (thin lines and shading) overlaid with cross-validated 
prediction of the regression model using all kernels (solid thick lines). Bottom 
graphs show mean rate (thin lines and shading) overlaid with fits excluding the 
indicated kernel (dashed lines). The good fit of the full model is lost when 
excluding the contralateral Vision kernels, indicating that this neuron has 
stimulus-locked activity that cannot be explained by other variables. b, Similar 


thalamus showed a net decrease in pre-stimulus activity during task 
context, other regions—including basal ganglia and other subcortical 
choice-encoding areas—showed a consistent increase (Fig. 5c, Extended 
Data Fig. 9a). This effect extended to neurons that were not otherwise 
responsive during the task (Supplementary Fig. 3). 
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the Allen Mouse Brain Common Coordinate Framework (version 3 (2017); 


downloaded from http://download.alleninstitute.org/informatics-archive/ 
current-release/mouse ccf/). 


Consistent with the hypothesis of a continuum of engagement across 
passive, Miss and Go trials, the pattern of firing accompanying task 
engagement predicted successful performance on individual trials 
(Fig. 5d-f). For this analysis, we examined only pre-stimulus activity, 
which could not be conflated with the large non-specific responses 
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Fig. 4 | Choice signals emerge simultaneously across a localized set of 
forebrain and midbrain areas. a, Firing rates of four example neurons, 
averaged across indicated trial types (shaded regions, s.e.m. across trials), 
cross-validated fits of the kernel model using all kernels (solid lines), and fits 
using all kernels except Choice (dashed lines). The VISp neuron can be 
accurately predicted without the Choice kernel, indicating that the differing 
responses between left and right choices can be explained by visual responses. 
The other three neurons cannot be predicted without the Choice kernel. The ZI 
neuronalso appeared in Fig. 2c. b, Fraction of neurons in each brain region for 
which accurate prediction required the Choice kernel (false-positive rate on 
shuffled data, 0.3%). Empty bars indicate areas for which the number of 
neurons passing analysis criteria was less than 5.c, Time courses of population 
decoding of choice from frontal cortex (MOs, MOp and PL), striatum (CP), and 
midbrain (MRN, SCm, Zl and SNr) did not significantly differ (P> 0.05, two-way 
ANOVA). Shaded regions, s.e.m. across recordings. d, Left, jPECC analysis 


related to movements. Regions that showed differences in pre-stimulus 
firing rate between task and passive contexts also showed similar dif- 
ferences between Go and Miss trials (Fig. 5d). Indeed, it was possible to 
predict whether a mouse would respond to the stimulus ona given trial 
from the ‘engagement index’, a projection of pre-stimulus population 
activity onto the axis defined by the difference of pre-stimulus activity 
in passive and task contexts (Fig. Se). This engagement index consist- 
ently differed between Go and Miss trials across recordings (Fig. 5f; 
paired f-test, P< 10), an effect that could not be fully explained by 
variability in pupil diameter, in overt movements detectable by video 
recordings, in the presence of a reward on the previous trial, or in the 


shows that population activity in visual cortex predicts that in frontal cortex 
following alag of about 40 ms, but only in the period about 200 ms before 
movement. Right, population activity in midbrain and frontal cortex donot 
showaconsistent lead-lag relationship. e, Trial-averaged firing rates of 
example neurons recorded in the midbrain (top row) and forebrain (bottom 
row) aligned to contralateral (orange) and ipsilateral choices (blue). f, g, Scatter 
plot of activity of individual midbrain and forebrain neurons at movement 
onset relative to baseline activity, for trials with contralateral versus ipsilateral 
choices (estimated from the kernel model). Darker points represent neurons 
with significant choice encoding. h, Summary of fand g ona brain map. Red and 
tanindicate regions containing neurons of unilateral or bilateral selectivity. 
Brain diagrams were derived from the Allen Mouse Brain Common Coordinate 
Framework (version 3 (2017); downloaded from http://download. 
alleninstitute.org/informatics-archive/current-release/mouse_ccf/). 


inter-trial interval (Extended Data Fig. 9b-k). This index is therefore 
distinguishable from correlates of movement, reward and arousal, 
and represents a specific brain-wide neural signature of engagement. 


Discussion 

Using brain-wide recordings of neuronal populations, we revealed 
organizing principles of the distribution of neurons carrying distinct 
correlates of a visual choice behaviour. Neurons non-selectively encod- 
ing action are widely distributed. Neurons encoding choice are rarer, 
are less widely distributed, and exhibit unilateral encoding in midbrain 
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Fig. 5| Task engagement correlates differently with cortical and subcortical 
activity. a, Comparison of population average spiking activity for several brain 
regions, for task-context trials when contralateral stimuli were presented but 
mice did not respond (thatis, Miss trials, blue) and for stimulus presentations 
during the passive context (grey). Visual stimulus contrasts were matched 
between the two conditions. b, Excess fraction of neurons significantly 
responsive on Miss trials compared to passive condition, for matched contrast 
stimuli.c, Brain map showing difference in pre-stimulus firing rate between task 
and passive conditions, averaged over all neurons in each region. d, Scatter plot 
showing difference in pre-stimulus rate between task and passive contexts 

(x axis) and between Go and Miss trials within the task context (y axis), averaged 
over all neurons inaregion. Text size indicates number of neurons in the analysis 


and bilateral encoding in forebrain regions. Correlates of engagement 
are characterized by enhanced subcortical activity and suppressed 
neocortical activity. 

Neurons with action correlates are found globally: neurons in nearly 
every brain region were non-selectively activated in the moments 
leading up to movement onset. This global representation of action 
is consistent with reports of widespread action correlates in multiple 
species**+78 and suggests that non-specific action correlates may in 
fact be ubiquitous in the mouse brain, cortically and subcortically. These 
signals may comprise forms of corollary discharge”, but cannot reflect 
sensory re-afference as they were observed before movement onset. 
Global non-selective action correlates may underlie brain-wide task- 
related activity observed in rodents” and humans”. This ubiquitous 
presence of non-selective action correlates underscores the advantage 
of multi-alternative tasks for revealing the neural correlates of behav- 
ioural choice: Go-NoGo tasks cannot distinguish neurons that fire non- 
specifically for any action from neurons selective for specific choices. 

The set of regions encoding choice is spatially restricted and is 
characterized by qualitatively distinct midbrain and forebrain com- 
ponents. It includes many of the regions classically implicated in choice 
behaviour including frontal cortex’, striatum’, substantia nigra pars 
reticulata’ and the deep layers of the superior colliculus®“°, but also 
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(range 130-1,534).e, On each trial, an engagement index is computed by 
projecting pre-stimulus population activity onto a vector defined by the each 
neuron’s rate difference between task and passive contexts. The graph shows 
probability of Go response asa function of zscored engagement index. Red, 
movement probability for each bin of engagement index (error bars indicate 
s.e.m. across trials). Black, logistic regression fit with 95% confidence bands 
(grey). f, Histogram of differences in pre-stimulus engagement index for Go 
versus Miss trials for each recording. Inverted triangle represents the mean 
value across recordings (mean = 8.42 AU). AU, arbitrary units. Brain diagrams 
were derived from the Allen Mouse Brain Common Coordinate Framework 
(version 3 (2017); downloaded from http://download.alleninstitute.org/ 
informatics-archive/current-release/mouse _ccf/). 


unexpected regions including the midbrain reticular nucleus and 
zona incerta. These regions also contained neurons encoding visual 
stimuli, even during passive stimulus presentation; whether visual 
neurons would be found in these ‘motor’ areas in untrained animals is 
not clear from these data. Our analyses revealed a striking anatomical 
organizing principle: choice neurons in the forebrain (neocortex and 
striatum) are enhanced before both contra- and ipsilateral choices 
and can prefer either, but choice neurons in the midbrain are almost 
exclusively enhanced for contralateral choices and often suppressed 
for ipsilateral choices. Despite this distinct encoding, we could not 
distinguish the timing of choice-related signals between these regions, 
an observation parsimoniously explained by a recurrent loop across 
them (Supplementary Discussion; Supplementary Fig. 4a). 
Brain-wide correlates of engagement are characterized by enhanced 
subcortical activity and suppressed neocortical activity before onset of 
visual stimulus. Engagement-related cortical suppression might seem 
surprising given that visual cortex is required for performance of this 
task?“!, However, increased arousal has been associated with reduced 
spiking activity and hyperpolarization in multiple cortical areas*””’, an 
effect that may improve signal-to-noise ratios of sensory representa- 
tions. Enhanced activity in subcortical areas, by contrast, brings activ- 
ity in these regions closer to the level at which actions are initiated, 


providing a potential mechanism for increased probability of actionin 
engaged states (Supplementary Discussion; Supplementary Fig. 4b-d). 

Insummary, we have identified organizing principles for the distri- 
bution and character of the neuronal correlates of a lateralized visual 
discrimination task across the mouse brain. Future work will be required 
to determine the circuit mechanisms that enforce these principles, how 
they extend to areas such as cerebellum and brainstem omitted from 
the current survey, and the degree to which similar principles govern 
the neural correlates of different choice tasks. 
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Methods 


Experimental procedures were conducted according to the UK Ani- 
mals Scientific Procedures Act (1986) and under personal and project 
licenses released by the Home Office following appropriate ethics 
review. 


Mice 

Experiments were performed on 10 male and female mice, between 
11 and 46 weeks of age (Supplementary Table 1). Multiple genotypes 
were used, including: Ai95;Vgelut1-Cre (B6).Cg-Gt(ROSA)26Sort™> CAS 
GCaMPof)Hze /Mwar] crossed with B6;129S-Slc17a7™"#"/]), TetO-G6 
s;Camk2a-tTa (B6;DBA-Tg(tetO-GCaMP6 s)2Niell/J crossed with B6.Cg- 
Tg(Camk2a-tTA)IMmay/DboJ), Snap25-G6 s (B6.Cg-Snap25"™"7£/J), 
Vglut1-Cre, and wild-type (C57Bl6/J). None of these lines are known to 
exhibit aberrant epileptiform activity**. Of 13 mice initially trained for 
inclusion in this study, three developed health complications before 
training completed and were not recorded. The other 10 successfully 
learned the task (see criteria below) and were included. The sample 
sizes (n=10 mice; n=39 recording sessions; n = 29,134 neurons) were 
not determined with a power analysis. 


Surgery 
A brief (around 1h) initial surgery was performed under isoflurane 
(1-3% in O,) anaesthesia to implant a steel headplate (approximately 15 
x3x0.5mm,1g) and, in most cases, a3D-printed recording chamber. 
The chamber was a semi-conical, opaque piece of polylactic acid with 
a12mm diameter upper surface and a lower surface designed to fit to 
the shape of an average mouse skull, exposing approximately the area 
from 3.5 mm anterior to 5.5mm posterior to bregma, and 4.5mm left to 
4.5mm right, and narrowing near the eyes. The implantation method 
largely followed the method of Guo etal. withsome modifications and 
was previously described“. In brief, the dorsal surface of the skull was 
cleared of skin and periosteum and prepared with a brief application 
of green activator (Super-Bond C&B, Sun Medical). The chamber was 
attached to the skull with cyanoacrylate (VetBond; World Precision 
Instruments) and the gaps between the cone and the skull were filled 
with L-type radiopaque polymer (Super-Bond C&B). A thin layer of 
cyanoacrylate was applied to the skull inside the cone and allowed to 
dry. Thin layers of UV-curing optical glue (Norland Optical Adhesives 
#81, Norland Products) were applied inside the cone and cured until 
the exposed skull was covered. The head plate was attached to the 
skull over the interparietal bone with Super-Bond polymer, and more 
polymer was applied around the headplate and cone. 

After recovery, mice were given three days to recover while being 
treated with carprofen, then acclimated to handling and head-fixation 
before training. 


Two-alternative unforced choice task 

The two-alternative unforced choice task design was described pre- 
viously’. Mice sat ona plastic apparatus with forepaws ona rotating 
wheel surrounded by three computer screens (Adafruit, LPO97QX1) 
at right angles covering 270 x 70 degrees of visual angle (d.v.a.). Each 
screen was roughly 11cm from the mouse’s eyes at its nearest point and 
refreshed at 60 Hz. The screens were fitted with Fresnel lenses (Wuxi 
Bohai Optics, BHPA220-2-5) to ameliorate reductions in luminance and 
contrast at larger viewing angles near their edges, and these lenses 
were coated with scattering window film (‘frostbite’, The Window 
Film Company) to reduce reflections. The wheel was a ridged rubber 
Lego wheel affixed to a rotary encoder (Kubler 05.2400.1122.0360).A 
plastic tube for delivery of water rewards was placed near the subject’s 
mouth. Licking behaviour was monitored by attaching a piezo film 
(TE Connectivity, CAT-PFSOO04) to the plastic tube and recording 
its voltage. Experiments were run with Rigbox software for MAT- 
LAB (Mathworks, Inc.)**. Full details of the experimental apparatus 


including detailed parts list can be found at http://www.ucl.ac.uk/ 
cortexlab/tools/wheel. 

Atrial was initiated after the subject had held the wheel still for a 
short interval (duration uniformly distributed between 0.2-0.5s on 
each trial; Fig. 1b). At trial initiation, visual stimuli were presented at 
the centre of the left and right screens, or directly left and right of the 
subject. These stimulus locations were in the central of the monocular 
zones of the mouse’s visual field so that no eye or head movements 
were required for the mice to see them. The stimulus was a Gabor patch 
with orientation 45°, sigma 9 d.v.a., and spatial frequency 0.1 cycles 
per degree. After stimulus onset there was a random delay interval of 
0.5-1.2 s, during which time the subject could turn the wheel without 
penalty, but visual stimuli were locked in place and rewards could not 
be earned. The mice nevertheless typically responded immediately to 
the stimulus onset. At the end of the delay interval, an auditory tone 
cue was delivered (8 kHz pure tone for 0.2 s) after which the visual 
stimulus position became coupled to movements of the wheel. Wheel 
turns in which the top surface of the wheel was moved to the subject’s 
right led to rightward movements of stimuli on the screen, that is, a 
stimulus on the subject’s left moved towards the central screen. Put 
another way, clockwise turns of the wheel, from the perspective of the 
mouse, led to clockwise movement of the stimuli around the subject. 
A left or right turn was registered when the wheel was turned by an 
amount sufficient to move the visual stimuli by 90 d.v.a. in either direc- 
tion (approximately 20 mm of movement of the surface of the wheel). 
When at least one stimulus was presented, the subject was rewarded 
for driving the higher contrast visual stimulus to the central screen (if 
both stimuli had equal contrast, left and right turns were rewarded 
with 50% probability). When no stimuli were presented, the subject 
was rewarded if no turn was registered during the 1.5 s following the 
Go cue. There were therefore three trial outcomes that could lead to 
reward depending on the stimulus condition (left turn, right turn, no 
turn), and in this sense the task was a three-alternative task. Immediately 
following registration of a choice or expiry of the 1.5 s window, feedback 
was delivered. If correct, feedback was a water reward (2-3 pl) delivered 
by the opening of a valve on the water tube for a calibrated duration. If 
incorrect, feedback was a white noise sound played for 1s. During the1 
s feedback period, the visual stimulus remained on the screen. Aftera 
subsequent inter-trial interval of 1s, the mouse could initiate another 
trial by again holding the wheel still for the prescribed duration. 

Trials of different contrast conditions were randomly interleaved. 
The experimenter was not blinded to contrast condition either during 
data acquisition or during analysis. 


Training protocol 
Mice were trained on this task with the following shaping protocol. First, 
high-contrast stimuli (SO or 100%) were presented only on the left or 
the right, with an unlimited choice window, and repeating trial condi- 
tions following incorrect choices (‘repeat on incorrect’). Once mice 
achieved high accuracy and initiated movements rapidly—approxi- 
mately 70 or 80% performance on non-repeat trials, and with reaction 
times nearly all <1s, but at the experimenter’s discretion—trials with 
no stimuli were introduced, again repeating on incorrect. Once sub- 
jects responded accurately on these trials (70 or 80% performance, 
at experimenter’s discretion), lower contrast trials were introduced 
without repeat on incorrect. Finally, contrast comparison trials were 
introduced, starting with high versus low contrast, then high versus 
medium and medium versus low, then trials with equal contrast on both 
sides. The final proportion of trials presented was weighted towards 
easy trials (high contrast versus zero, high versus low, medium versus 
zero, and no-stimulus trials) to encourage high overall reward rates 
and sustained motivation. 

On most trials for which subjects eventually made a left or right 
turn by the end of the trial, the subjects responded immediately to 
the stimulus presentation, turning the wheel within 400 ms of stimulus 


appearance (64.9 + 14.0% (mean +s.d.),n=39 sessions), nearly always 
inthe same direction as their final choice (96.6 + 3.4%). For this study, 
data analyses focused on this initial 400 ms period, and we defined 
left and right choice trials as those in which this period contained the 
onset of aclockwise or counterclockwise turn of sufficient amplitude 
(90 d.v.a.), and NoGo trials as those in which it contained no detectable 
movement. To exclude trials in which wheel turns were coincidentally 
made before subjects could respond to the stimuli, only trials with 
movement onset between 125 to 400 ms post-stimulus onset, or with 
no movement of any kind during the window from —50 to 400 ms post- 
stimulus onset, were included. Trials with other movements, that were 
detectable but would not have resulted in registering a choice by the 
end of the movement, were excluded. 

The algorithm for detecting wheel movement onsets (‘find- 
WheelMoves3, https://github.com/cortex-lab/wheelAnalysis/blob/ 
master/+wheel/findWheelMoves3.m) was designed to identify the 
earliest moment at which the wheel began detectably moving. First, 
non-movement periods were identified as those which had less than 
1.1mm of wheel movement over 0.25 duration. Next, amorphological 
closure (dilation then erosion) of these periods was performed with size 
0.1s to remove gaps smaller than this. Finally, the timing of the ends 
of the non-movement periods were refined by looking sequentially 
backwards in time to identify the first moment at which the position 
deviated by more than a smaller threshold of 0.2 mm. The double- 
threshold procedure (first 1.1mm, then looking backwards for 0.2 mm) 
was necessary because 0.2 mm is just two units of the rotary encoder’s 
measurement, and these two-unit steps could happen owing to noise 
at any time. In this way, the movement onsets (and consequently the 
reaction times) were measured at the resolution of the rotary encoder. 
Smaller detection thresholds would lead to earlier detection of wheel 
turns, but potentially at the risk of false-positive detections. To assess 
how our detector performed, we decoded the subject’s choice from the 
instantaneous wheel velocity (difference in wheel position between t 
and t-10 ms) at different times relative to detected movement onset 
(Extended Data Fig. 1q). The decoder performed essentially at chance 
50 ms before movement onset (47.7%) compared to near perfect per- 
formance 50 ms after movement onset (94.6%). 

Behavioural trials when the mouse was disengaged were excluded 
from analysis. These trials were defined as Miss trials (stimulus present 
but wheel not turned) preceded by two or more other Miss trials, as 
well as all NoGo trials occurring consecutively at the end of the session. 

When analysing activity following reward delivery (Extended Data 
Fig. 4e), only correct NoGo trials were included, that is, trials with no 
visual stimulus and no wheel movement. 

Sessions were included when at least 12 trials of each type (left, right 
and NoGo) could be included for analysis, and when anatomical locali- 
zation was sufficiently confident (see below). 

For analyses requiring matching stimulus contrasts across trials with 
different choices, we considered all trials with contralateral stimulus 
contrast greater than zero, and split them by low, medium, and high 
contralateral contrast. For each contrast level, we counted the number 
of trials with that contrast and each response type (left or right; NoGo; 
or passive condition). We took the minimum of these three numbers, 
and selected that many trials randomly from each group. This resulted 
inthree sets of trials—trials with left or right choices; trials with NoGos; 
and trials in the passive condition—which each contained low, medium 
and high contralateral contrasts but which all contained exactly the 
same numbers of each contrast. When fewer than 10 such trials could 
be found, the session was excluded for the matched-contrast analyses 
(n= 34 of 39 sessions included). 


Video monitoring 

Eye and body movements were monitored by illuminating the subject 
with infrared light (830 nm, Mightex SLS-0208-A). The right eye was 
monitored with a camera (The Imaging Source, DMK 23U618) fitted 


with zoom lens (Thorlabs MVL7000) and long-pass filter (Thorlabs 
FELO750), recording at 100 Hz. Body movements were monitored 
with another camera (same model but with a different lens, Thorlabs 
MVL16M23) situated above the central screen, recording at 40 Hz. 


Neuronal recordings 

Recordings were made using Neuropixels (Phase3A Option 3) electrode 
arrays!, which have 384 selectable recording sites out of 960 sites ona 
1-cm shank. Probes were mounted to a custom 3D-printed polylactic 
acid piece and affixed to a steel rod held by a micromanipulator (uMP-4, 
Sensapex Inc.). To allow later track localization, before insertion probes 
were coated with a solution of Dil (ThermoFisher Vybrant V22888 or 
V22885) by holding 2pl in a droplet on the end of a micropipette and 
touching the droplet to the probe shank, letting it dry, and repeating 
until the droplet was gone, after which the probe appeared pink. 

On the day of recording or within two days before, mice were briefly 
anaesthetized with isoflurane while one or more craniotomies were 
made, either with a dental drill or a biopsy punch. After at least three 
hours of recovery, mice were head-fixed in the setup. Probes had a 
soldered connection to short external reference to ground; the ground 
connection at the headstage was subsequently connected to an Ag/AgCl 
wire positioned onthe skull. The craniotomies as well as the wire were 
covered with saline-based agar. The agar was covered with silicone oil to 
prevent drying. In some experiments, a saline bath was used rather than 
agar. Two or three probes were advanced through the agar and through 
the dura, then lowered to their final position at approximately 10 ums. 
Electrodes were allowed to settle for around 15 min before starting 
recording. Recordings were made in external reference mode with local 
field potential gain = 250 and action potential gain = 500. Recordings 
were repeated at different locations on each of multiple subsequent 
days (Supplementary Table 2), performing new craniotomy proce- 
dures as necessary. All recordings were made in the left hemisphere. 
The ability of a single probe to record from multiple areas, and the 
use of multiple probes simultaneously, led to anumber of areas being 
recorded simultaneously in each session (Supplementary Table 3). 


Passive stimulus presentation 

After each behaviour session, we performed a passive replay experi- 
ment while continuing to record from the same electrodes. Mice were 
presented with two types of sensory stimuli without possibility of 
receiving reward for any behaviour: replay of task stimuli; and sparse 
flashed visual stimuli for receptive field mapping. 

The replayed task stimuli were: left and right visual stimuli of each 
contrast; some combinations of left and right visual stimuli simultane- 
ously; Go cue beeps; white noise bursts; and reward valve clicks (but 
witha manual valve closed so that no water was delivered). These stimuli 
were replayed at 1-2 randomized intervals for 10 or 25 randomly inter- 
leaved repetitions each. 

Receptive fields were mapped with white squares of 8 d.v.a. edge 
length, positioned ona 10 x 36 grid (some stimulus positions were 
located partially off-screen) on a black background. The stimuli were 
shown for 10 monitor frames (167 ms) at a time, and their times of 
appearance were independently randomly selected to yield an aver- 
age rate of approximately 0.12 Hz. 


Data analysis 

The data were automatically spike sorted with Kilosort® (https://github. 
com/cortex-lab/Kilosort) and then manually curated with the phy GUI 
(https://github.com/kwikteam/phy). Extracellular voltage traces were 
preprocessed with common-average referencing”: subtracting each 
channel’s median to remove baseline offsets, then subtracting the 
median across all channels at each time point to remove artefacts. 
During manual curation, each set of events (unit) detected by a par- 
ticular template was inspected and if the events comprising the unit 
were judged to correspond to noise (zero or near-zero amplitude; 
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non-physiological waveform shape or pattern of activity across chan- 
nels) rather than spikes, the entire unit was discarded. Units containing 
low-amplitude spikes, spikes with inconsistent waveform shapes, and/ 
or refractory period contamination were labelled as ‘multi-unit activity’ 
and not included for further analysis. Finally, each unit was compared to 
similar, spatially neighbouring units to determine whether they should 
be merged, on the basis of spike waveform similarity, drift patterns or 
cross-correlogram features. Units were also excluded if their average 
rate in the analysis window (stimulus onset to 0.4 s after; trial firing 
rate) was less than 0.1 Hz. Units passing these criteria were considered 
to reflect the spiking activity of a neuron. 

Neurons were only included for further analysis when at least 13 
neurons passing the above criteria were identified as coming fromthe 
same brain region, inthe same experiment. Furthermore, brain regions 
were only included for which recordings from at least two subjects had 
sufficient numbers of neurons. 

To determine whether aneuron’s firing rate was significantly modu- 
lated during the task (Supplementary Fig. 1), aset of six statistical tests 
were used to detect changes in activity during various task epochs 
and conditions: (1) Wilcoxon sign-rank test between trial firing rate 
(rate of spikes between stimulus onset and 400 ms post-stimulus) 
and baseline rate (defined in period —0.2 to Os relative to stimulus 
onset on each trial); (2) sign-rank test between stimulus-driven rate 
(firing rate between 0.05 and 0.15s after stimulus onset) and baseline 
rate; (3) sign-rank test between pre-movement rates (—0.1 to 0.05 s 
relative to movement onset) and baseline rate (for trials with move- 
ments); (4) Wilcoxon rank-sum test between pre-movement rates on left 
choice trials and those on right choice trials; (5) sign-rank test between 
post-movement rates (—0.05 to 0.2 relative to movement onset) and 
baseline rate; (6) rank-sum test between post-reward rates (0 to 0.15 
s relative to reward delivery for correct NoGos) and baseline rates. A 
neuron was considered active during the task, or to have detectable 
modulation during some part of the task, ifany of the Pvalues on these 
tests were below a Bonferroni-corrected alpha value (0.05/6 = 0.0083). 
However, because the tests were coarse and would be relatively insensi- 
tive to neurons with transient activity, a looser threshold was used to 
determine the neurons included for statistical analyses (Figs. 3-5): if 
any of the first four tests (that is, those concerning the period between 
stimulus onset and movement onset) had a P value less than 0.05. 

In determining the neurons statistically significantly responding 
during different task conditions (Figs. 2d-h, right sub-panels, 5b), the 
mean firing rate in the post-stimulus window (0 to 0.25s), taken across 
trials of the desired condition, was z-scored relative to trial-by-trial 
baseline rates (from the window —0.1 to O) and taken as significant 
when this value was >4 or <-4, equivalent to a two-sided t-test atP<10™. 

For visualizing firing rates (Extended Data Fig. 4), the activity of 
each neuron was then binned at 0.005 s, smoothed with a causal half- 
Gaussian filter with standard deviation 0.02 s, averaged across tri- 
als, smoothed with another causal half-Gaussian filter with standard 
deviation 0.03 s, baseline subtracted (baseline period —0.02 toOs 
relative to stimulus onset, including all trials in the task), and divided 
by baseline + 0.5 spikes s7. Neurons were selected for display if they had 
asignificant difference between firing rates on trials with both stimuli 
and movements versus trials with neither, using a sliding window 0.1 
s wide and in steps of 0.005 s (rank-sum P< 0.0001 for at least three 
consecutive bins). 

Visual receptive fields (Extended Data Fig. 2d) were determined by 
sparse noise mapping outside the context of the behavioural task. The 
evoked rates for each presentation were measured as the spike countin 
the 200 ms following stimulus onset. The rates evoked by stimuli at the 
peak location and surrounding four nearest locations were combined 
and compared tothe rates for all locations >45 d.v.a. from the peak loca- 
tion using a Wilcoxon rank-sum test. Any neurons for which the p value 
of the test was less than 10° were counted as having a significant visual 
receptive field. Note that neurons are included in analyses regardless 


of receptive field location; in particular, recorded LGd neurons did not 
have receptive field locations overlapping with task stimuli. 


Kernel regression analysis 

To identify choice-selective neurons, we began by fitting a kernel- 
regression model**®® In this analysis, the firing rate of each neuron 
is described as a linear sum of temporal filters aligned to task events. 
For the current study, only visual stimulus onset and wheel movement 
onset kernels were required, since we consider here only the period in 
between the two. Inthe model, the predicted firing rate f,(0 for neuron 
nis givenas 
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Here, c represents the six stimulus types (contralateral low, medium, 
or high, or ipsilateral low, medium, or high), S, represents the set of 
times for which this contrast appeared, and K,,,,(t) represents the Vision 
kernel function of this contrast for neuron n. M represents the set of 
movement times and K,,, ,(¢) represents the Action kernel for neuron 
n; D,, represents direction of movement m (encoded as +1), and K,, 
represents the Choice kernel for neuron n. The Vision kernels K,,,(t) are 
supported over the window —0.05 to 0.4 s relative to stimulus onset, 
and the Action and Choice kernels are supported over the window 
-0.25 to 0.025 s relative to movement onset. Before estimating the 
kernels, the discretized firing rates f,(0 for each neuron were estimated 
by binning spikes into 0.005-s bins and then smoothing with a causal 
half-Gaussian filter with standard deviation 0.025 s. The Vision kernels 
therefore containL.=90 time bins, whereas Action and Choice kernels 
contain L,=55 time bins. 

The large number of parameters to be fit, combined with the rela- 
tively small number of trials of each type pose a challenge for estima- 
tion. We devised a solution to this problem that makes use of the large 
number of neurons recorded using reduced-rank regression (Extended 
Data Fig. 5b), which we found to give better cross-validated results 
(see next section). 

First, for each kernel to be fit, we construct a Toeplitz predictor 
matrix (Extended Data Fig. 5d). For stimuli of contrast c, we define a 
Toeplitz predictor matrix P, of size Tx L., in which 7 is the total number 
of time points in the training set, andL.is the number of lags required 
for the Vision kernels. The predictor matrix contains diagonal stripes 
starting each timea visual stimulus of contrast cis presented: P,(¢,i) =1 
ift-ieS,and 0 otherwise. Predictor matrices of size 7x L, were defined 
similarly for the Action and Choice kernels, and the six stimulus predic- 
tor matrices and two movement predictors are horizontally concat- 
enated to yield a global prediction matrix P of size T x 650. The total 
length of all kernels for one neuron is 650 = 6L,+ 2L,. 

The simplest approach to fit the kernel shapes would be to minimize 
the squared error between true and predicted firing rate using linear 
regression. To do this, we would horizontally concatenate the rate 
vectors of all Nneurons together into a 7 x N matrix F, and estimate 
the kernels for each neuron by finding a matrix K of size 650 x Nto 
minimize the squared error F = ||F - PK||’, using the total sum square 
(Frobenius) norm. However, as eachk,, has 650 parameters, linear 
regression results in noisy and overfit kernels when fit to a single neu- 
ron, particularly given the high trial-to-trial variability of neuronal 
firing. Although expressing the kernels as a sum of basis functions 
can reduce the number of required parameters‘, the success of this 
method depends strongly on the choice of basis functions, and the 
appropriate choice will differ depending on properties of the task and 
stimuli. The large number of neurons in the current dataset enables an 
alternative approach. 

This approach is based on reduced-rank regression™, which allows 
regularized estimation by factorizing the kernel matrix K into the prod- 
uct of a 650 x r matrix B and ar x N matrix W, minimizing the total 


error: £ = ||F - PBW|I?. The 7 x r matrix PB may be considered as a set 
of temporal basis functions, which can be linearly combined to esti- 
mate each neuron’s firing rate over the whole training set. Reduced 
rank regression ensures that these basis functions are ordered, so that 
predicting population activity from only the first rcolumns will result 
in the best possible prediction from any rank r matrix. 

To estimate each neuron’s kernel functions, we estimated a weight 
vector w,, to minimize an error E, =|f,, - PBw,,|” for each neuron with elas- 
tic net regularization (using the package cvg|mnet for Matlab (http:// 
www.stanford.edu/~hastie/gimnet_matlab/) with parameters a=0.5 
andA=0.5), and used cross-validation to determine the optimal num- 
ber of columns r, of PB to keep when predicting neuron n. The kernel 
functions for neuronn were then unpacked from the 650-dimensional 
vector obtained by multiplying the first r, columns of B by w,. Neurons 
with total cross-validated variance explained of <2% were excluded 
from analyses. 


Comparison of reduced-rank model to alternative models 

To demonstrate the validity of this reduced-rank kernel method, we 
compared its performance of the reduced-rank regression method to 
two alternative approaches to spike train prediction: (1) fitting regres- 
sion to the Toeplitz predictor matrix directly, and (2) a model with 
raised cosine basis functions (Extended Data Fig. 5d). 

The Toeplitz predictor matrix was the matrix P described above. 
The cosine predictor matrix was constructed similarly, but with each 
row containing a raised cosine of width 100 ms, and spaced by 25 ms. 
The cosine predictor matrix therefore contained L.=18 rows for each 
of the 6 contrasts, and, =11time rows for each movement kernel, for 
130 predictors in total (Extended Data Fig. 5d). 

To evaluate the performance of these three methods, we fit regres- 
sion weights with elastic net regression as described above, and evalu- 
ated performance as the percentage of variance explained (fivefold 
cross-validation across trials). To compare models fairly, the number 
of columns included in the reduced-rank model was not allowed to 
vary per neuron as described above, but was instead fixed at n=18 
components. The reduced-rank projection matrix B was not itself 
cross-validated. The reduced-rank method outperformed both other 
methods (Extended Data Fig. 5e) 

To estimate the degree to which differing performance of the models 
arose from over- versus under-fitting, we computed the ‘proportion 
of overfit explained variance’ as 


CVirain= CVeest 
CVirain 


where CV,,,i, is the training-set variance explained and CV,,., is the test- 
set variance explained, thus quantifying the difference between the 
two (a measure of overfitting) relative to the total variance explained 
(Extended Data Fig. 5f). 


Determining individual neuron selectivity 
Toassess the selectivity of individual neurons for each kernel, we used 
anested approach. We first fit the activity of each neuron using the 
reduced-rank regression procedure above (including deriving anew 
basis set), but excluding the kernel to be tested. We subtracted this 
prediction from the raw data to yield residuals, representing aspects 
ofthe neuron’s activity not explainable from the other kernels. We then 
repeated the reduced-rank regression procedure one more time, using 
the residual firing rates as the independent variable, and using only the 
test kernel. The cross-validated quality of this fit determined the vari- 
ance explainable only by the test kernel. If this variance explained was 
>2%, the neuron was deemed selective for that kernel and was included 
in Fig. 3d,e or Fig. 4b. 

Inprinciple, inaccuracies in the model fits from one kernel could leave 
variability to be explained by another correlated variable, resulting in 


false positives from this test. For instance, since motor actions are cor- 
related with visual stimuli, activity related to task-unrelated movements 
(for example, as reported in ref. ”) could appear to be visually related 
activity ifit was not accurately captured by the Action and Choice ker- 
nels. However, we consider it unlikely that a significant proportion 
of the contralateral vision correlates reported here arise from sucha 
confound, since the same argument should apply to ipsilateral vision 
correlates, and we found very little of such correlates (Extended Data 
Fig. 5g). 

Another potential source of error is that neurons were chosen for 
inclusion inthe analysis based on criteria that could relate to the results 
of the analysis. Specifically, one of the criteria by which a neuron could 
beincluded was the observation of significant differences between left 
and right choice trials (see above). However, our empirically estimated 
false discovery rate is very low (0.3%, Extended Data Fig. 5h), and it 
should not differ between brain regions. Thus, the observation that 
choice-selective neurons were not found in most brain regions studied 
is aninternal control, showing that false discovery of neurons based on 
analysis inclusion criteria cannot account for our findings. 


Estimation of false-positive rate for determining individual 
neuron selectivity with reduced-rank kernel regression 

To choose the threshold for counting aneuronas selective, we searched 
fora value giving low false-positive error rates for choice selective neu- 
rons. To estimate false-positive rates, we performed a shuffle analysis, 
relabelling each trial with a left or right choice with a randomly drawn 
choice from another left or right choice trial, without replacement. 
The analysis was then repeated from the start, including fitting the 
reduced-rank regression and the cross-validated nested model. We 
selected the threshold for counting a neuron as choice selective to 
ensure a low false-positive error rate as assessed with this measure 
(0.33%; Extended Data Fig. 5h). 


Population decoding of task correlates 

To perform population decoding (Fig. 4c; Extended Data Fig. 5g), we 
began with the residual firing rates produced as described above, pro- 
duced by fitting without a test kernel. We then split trials in a binary 
fashion: trials that had versus those that did not have an ipsilateral 
stimulus; those having versus not having a contralateral stimulus; those 
haing versus not having any movement (either left or right); those hav- 
ing left choice versus having right choice (considering only trials with 
one of the two). We identified a population coding direction encoding 
the difference between the two sets of trials, by fitting an L1-regularized 
logistic regression on data from training trials, using the period 0.05 
to 0.15s relative to stimulus onset for Vision decoding, andthe period 
-0.05 to Os relative to movement onset for Action or Choice decoding. 
We then predicted the binary category of test data by projecting firing 
rates from test set trials, from each time point during the trial, onto 
the weight vector of the logistic regression. Although itis in principle 
possible that these signals are encoded inanonlinearly separable way, 
the robust predictions obtained suggest information can be read out 
linearly. The population decoding was taken as the difference between 
projections between test-set trials of each binary category. For Action 
decoding, where trials with left or right choices were compared to those 
with neither, a‘movement onset time’ was chosen for trials without a 
movement randomly from the distribution of movement onset times 
on left and right choice trials. 

To statistically compare decoding time course across areas, we took 
the population decoding score from each key area in each recording 
(n= 29 populations from frontal cortex including MOs, PL and MOp; 
n=29 populations from midbrain including SCm, MRN, SNr and ZI; 
n=5striatum, CP), and normalized each so the mean across record- 
ings within an area was 1at choice time. We then performed a two-way 
ANOVA, with factors time relative to movement onset and area (frontal, 
midbrain and striatum). We found a significant effect of time on choice 
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decoding (50 d.f., F=10.43, P<10”) but no significant effect of area (2 
d.f., F=0.28, P> 0.05) and no significant interaction between time and 
area (100 d.f., F=0.12, P> 0.05). 


jPECC analysis 

To perform jPECC analysis (Fig. 4d, Extended Data Fig. 8), we took 
spike counts of individual neurons from simultaneously recorded 
regions, assembling neurons from all regions within a group into one 
population vector per region group. The three region groups were: VIS: 
VISp, VISpm, VISI, VISrl, VISa, VISam; midbrain: SCm, MRN, ZI, SNr; and 
frontal: MOs, MOp, PL. One jPECC per pair of region groups was com- 
puted per recording. Spikes were counted in10-ms bins and smoothed 
with a half-Gaussian causal filter with 25-ms standard deviation, and 
normalized by dividing by baseline +1 spikes“. Principal components 
analysis was then performed for each region group, across time points 
and trials to reduce population activity to ten dimensions. Trials were 
divided tenfold into training and test sets. Canonical correlation analy- 
sis was performed on the training set PCs from each region group, L2 
regularized using A= 0.5. The test-set PCs were projected onto the top 
canonical dimension, and the Pearson correlation coefficient was com- 
puted between these projections across test-set trials. This process was 
repeated for each pair of time bins, creating a matrix of cross-validated 
correlation coefficients, with one entry per pair of time points relative 
to the event. When representing a single recording’s jPECC analysis, 
the statistical significance of these correlation coefficients was used 
to grey out non-significant regions (Extended Data Fig. 8b) but this 
value was not used in further analyses. 

To quantify lead-lag relationships across recordings, an asymmetry 
index was computed by diagonally slicing the jPECC matrix from —50 to 
+50 ms relative to each time point. The average correlation coefficient 
across the left half of this slice (that is, the average along a vector from 
[t- 50,t + 50] to [t,t]) was subtracted from the right half of this slice 
(from [t,t] to [C+ 50,t- 50]) to yield the asymmetry index for time point 
t. This index was computed for each time point t relative to events and 
the values across recordings were compared to 0 witha t-test. 


Engagement index and pre-stimulus analyses 

To statistically compare pre-trial firing rates between the task and 
passive conditions (that is, between trials of active task performance, 
versus later passive stimulus replay, Extended Data Fig. 9a), we per- 
formed a nested multiple ANOVA test, in order to account for cor- 
related variability between neurons within recording sessions. Each 
observation was a neuron’s average measured pre-trial firing rate in 
the window between 250 and 50 ms before stimulus onset, log trans- 
formed (log,.(x + 1 spike s“)) to make distributions approximately 
normal. Any trials with detectable wheel movement in this interval 
were excluded. The ANOVA had three factors: active or passive condi- 
tion, recording session, and neuron identity (nested within recording 
session). A separate ANOVA was performed for each brain region, and 
the null hypothesis of no difference between baseline rates in active and 
passive conditions for neurons froma given brain region was rejected if 
the P-value for the active/passive condition factor was less than 0.0012, 
that is, less than 0.05 after applying a Bonferroni correction for the 42 
brain regions tested. 

To compute the trial-by-trial ‘engagement index’, we took the dif- 
ference in pre-stimulus firing between the average of all task (‘active’) 
trials aand ofall passive trials p, over the 200-ms period before stimulus 
onset: 


Xq= fo (-0.2<t<0)- f? (-0.2<t<0) 


This quantity was computed for each neuron in one of the areas with 
significant differences between task and passive determined by nested 
ANOVA analysis (319 + 32.5 (mean + s.e.m.) neurons included per ses- 
sion, n = 34 sessions), accumulated into a vector x and normalized 


to unit L2 magnitude for each session. To compute the engagement 
index for each trial iwe computed the dot product x-f, where fis the 
vector of pre-stimulus firing rates for each trial i. For the summary 
analysis in Fig. 5f, we then computed the mean across Go trials and the 
mean across Miss trials, and took the difference of the two. The Go and 
Miss trials included in this analysis were matched for contrast so that 
difference in visual drive could not influence the difference between 
trial types. To do this, Ntrials were selected from each contralateral 
contrast condition, where N was the minimum of the number of trials 
at that contrast condition having a Go outcome and the number hav- 
ing aNoGo outcome. 


Measurement of pupil area and video motion energy 

We measured pupil area (Extended Data Fig. 9g, i, k) from the high-zoom 
videos of the subject’s eye, using DeepLabCut™. In approximately 200 
training frames randomly sampled across all sessions, 4 points spaced 
at 90° around the pupil were manually identified, and the network was 
trained with default parameters. Then, around 100 more frames were 
manually annotated focusing on frames with errors, and the network 
re-trained. The pupil area was taken to be the area of an ellipse with 
major and minor axis lengths given by the distances between oppo- 
site pairs of detected points. Some recordings were excluded from 
these analyses owing to video quality that was unusable for sufficiently 
accurate measurement of the pupil (n=5 out of 39), primarily owing to 
obscured pupils due to eyelashes or eyelids. 

We measured video motion energy (Extended Data Fig. 9h, i, k) 
from the low-zoom videos of the frontal aspect of the subjects, which 
included the face, arms, and part of the torso of the mice. As nothing 
inthe frame except the mouse could move, we calculated total motion 
energy of the pixels of the video as an index of overt movements of the 
mouse. This was computed as: 


d abs(i,~ i;-1) 


pixels 


where i,is the intensity of the pixel on frame t. 
Both pupil area and video motion energy were z-scored before gen- 
eralized linear model (GLM) fitting. 


GLM prediction of P(Go) from pre-stimulus variables 

To determine the impact of arousal-, reward- and history-related vari- 
ables onthe ability to predict whether the upcoming trial’s outcome 
would bea Go response (that is, a left or right choice), and whether 
these factors could account for the relationship between Engagement 
Index and Go/Miss trials, we fit a GLM model to several measures of 
overt behaviour (Extended Data Fig. 9). These behavioural variables 
were: inter-trial interval; previous trial reward outcome (coded as O 
or 1); pupil area in the pre-stimulus window (z-scored); and motion 
energy in the pre-stimulus window (z-scored). A GLM with binomial 
link function was fit to these variables (Matlab function ‘fitg!m’) 
to predict whether the following trial had a Go or NoGo outcome. 
Squared terms were included for pupil area and video motion energy 
after it was observed that the empirical relationship between each of 
these and the P(Go) had an inverted U-shape™. Trials were selected 
for the model fitting to have matched contrast between these two 
types (see Engagement index above). A deviance test was used to 
compare this model with a model that additionally had engagement 
index included as a predictor. A significant value of this test does 
not indicate that the engagement index suffices to predict P(Go), 
with other variables making no further contribution; rather, it indi- 
cates that the other variables did not fully predict P(Go), and that 
engagement index can improve this prediction. The population vec- 
tor analysis in Extended Data Fig. 9k further argues that engagement 
index relates to P(Go) more closely than does the population vector 
of these other variables. 


Combined-conditions choice probability and detect probability 
analysis 
Choice probability isanon-parametric measure of the difference in fir- 
ing rate between trials with identical stimulus conditions but different 
choices. Typically, it is calculated separately for each stimulus condi- 
tion, but here we used an algorithm that combines observations across 
stimulus conditions into one number, allowing it to be calculated even 
for our small number of trials per condition (since there are 16 stimulus 
conditions, and trials with NoGo responses are excluded for the choice 
probability calculation). Choice probability is classically calculated as 
the area under a receiver operating characteristic (ROC) curve, which 
is equivalent to a Mann-Whitney U statistic, that is, to the probability 
that a firing rate observation from the trials with one choice is greater 
than that from trials with the other choice. Accordingly, this can be 
calculated by comparing each trial of one condition to each of the other 
condition, counting the number of such comparisons for which the first 
condition wins, and dividing by the total number of comparisons. To 
extend the method to a situation of many stimulus conditions but few 
trials of each condition, we add the numerators and denominators of 
this ratio across all conditions, and then divide. In this way, the core 
logic of choice probability—that trials of one choice are only compared 
totrials of the other choice under identical stimulus conditions—is pre- 
served, but all stimulus conditions can be combined into one number 
per neuron. This number quantifies the same thing as the classic choice 
probability, namely, the probability that the spike count of aneuron will 
be greater for trials of one choice than another, given matched stimulus 
conditions. To estimate statistical significance, we used a shuffle test 
in which trial labels (as left or right choice) were randomly permuted 
within each stimulus condition 2,000 times, and the choice probability 
was computed for each shuffle. Because this algorithm combines trials 
from multiple stimulus conditions into a single statistic, we refer to it 
as the combined-conditions choice probability (ccCP). 

We used the same algorithm to compute a detect probability (DP) com- 
paring trials with NoGo outcomes to those with either left or right choices. 

Importantly, not all trials can be included in the ccCP analysis. Specifi- 
cally, the trials from any stimulus condition in which the subject only 
made left or only made right choices cannot contribute to the ccCP. 
This is unlike the kernel regression analysis, in which those trials would 
still contribute to the estimates of the vision and choice kernels (and 
note that the kernel analysis additionally makes use of reaction time 
variability to separate these representations). As a result, six sessions 
had to be excluded for having fewer than ten trials include-able in the 
ccCP analysis. Notably, two of these six sessions contained some of 
the MOs neurons with significant choice representations under the 
kernel analysis, so that 20.8% of MOs choice-selective neurons were 
not included here. 


Focality index 
To statistically test the degree to which neurons encoding different task 
variables were localized, we used a focality index, defined as: 


> (2%) 
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where p, is the proportion of neurons in an area selective for the task 
correlate of interest (action, contralateral vision or choice), as assessed 
by reduced-rank regression analysis (Fig. 3c, e, 4b). This measure is an 
adaptation of the sparseness measure of Treves and Rolls™, and would 
take the value 1 if all neurons were located ina single region, and the 
value 1/Nif neurons were equally probable in N regions. Ninety-five per 
cent confidence intervals were computed using the bootstrap, witha 
normal approximated interval with bootstrapped bias and standard 
error (function bootci in Matlab). 


Anatomical targeting 

Toselect probe insertion trajectories, we first identified desired record- 
ing sites, and then designed appropriate trajectories to reach them using 
theallen_ccf_npx gui (A.J. Peters, www.github.com/cortex-lab/allenCCF). 
In doing so, Allen CCF coordinate (5.4 mm AP, 0 DV, 5.7 LR) was taken as 
the location of bregma. Craniotomies were targeted accordingly and 
angles of insertion were set manually. For some of the visual cortex 
recordings, surface insertion coordinates were targeted based on prior 
widefield calciumimaging. Widefield imaging used techniques described 
previously“, during presentation of the sparse visual noise receptive field 
mapping stimulus described above. Responses to visual stimuli near the 
intended location of the task stimuli were combined and used to identify 
cortical locations with retinotopically aligned neurons. In some cases, 
these same imaging sessions were used to target MOp and SSp record- 
ings to the area of large activity observed during forelimb movements 
that covers both of those areas“. Finally, MOs recordings were targeted 
at and around the cortical coordinates identified as disrupting task per- 
formance when inactivated, around +2 mm AP, 1mm ML*. 


Histological probe localization 

Recording sites were localized to brain regions by manual inspection 
of histologically identified recording tracks, in combination with align- 
ment tothe Allen Institute Common Coordinate Framework, as follows. 

Mice were perfused with 4% paraformaldehyde, the brain was 
extracted and fixed for 24 h at 4 °C in paraformaldehyde, then trans- 
ferred to 30% sucrose in PBS at 4 °C. The brain was mounted ona 
microtome in dry ice and sectioned at 60-pm slice thickness. Sections 
were washed in PBS, mounted on glass adhesion slides, and stained with 
DAPI (Vector Laboratories, H-1500). Images were taken at 4x magnifica- 
tion for each section using a Zeiss AxioScan, in three colours: blue for 
DAPI, green for GCaMP (when present), and red for Dil. 

An individual Dil track was typically visible across multiple slices, 
and recording locations along the track were manually identified by 
comparing structural aspects of the histological slice with features in 
the atlas. In most cases, this identification was aided by reconstruc- 
tion of the track in Allen CCF coordinates. To achieve this, we used the 
following procedure. 

First, we manually identified the 3D locations within the Allen Com- 
mon Coordinate Framework of each observed Dil spot on each slice 
(15.1+ 6.9 such spots per probe, Extended Data Fig. 2a). Code for doing 
this is provided open-source (https://github.com/cortex-lab/allenCCF; 
in particular, ‘sharp-track”’). There was no ambiguity about which dye 
stain corresponded to which penetration, as penetrations were not 
repeated within a subject. 

After identifying 3D points along a probe penetration, we fit a line 
to those points, which represents an estimate of the probe trajectory 
based onall Dil spots (Extended Data Fig. 2b). We quantified the lateral 
localization error by the median distance of the Dil spots from this 
common trajectory (39.3 1m) as a quantitative estimate of the error 
of step no. 1. Though this error is already small, the fitting of a line to 
many points presumably further reduces the error, resulting ina reli- 
able estimate of the probe’s vector through the brain. Moreover, the 
predicted set of brain regions that this vector passes through can be 
directly verified by histological inspection (Extended Data Fig. 3). On 
completion of this step, the method therefore provides an estimate of 
the probe trajectory through the 3D atlas, but does not yet provide a 
mapping from each recording site to a location along this trajectory. 

Next, we found the longitudinal mapping from recording sites to 
the probe trajectory. While the tip location provides one number that 
can identify the offset, it cannot estimate scaling that may vary owing 
to shrinkage. We therefore adopted an alternative approach that uses 
multiple electrophysiological landmarks to estimate scaling and depth 
ona brain-by-brain basis. These landmarks consist of thin structures 
suchas the CAI pyramidal layer, or white matter boundaries, whichare 
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unambiguously identifiable in spike rasters. Behavioural task correlates 
and visual receptive fields were not considered during this alignment 
procedure. We found the offset and scaling of a linear relationship 
between recording site number and distance along the probe trajectory 
by linear regression of these multiple landmarks. We show an example 
of this process (Extended Data Fig. 2c—e), in which six landmarks are 
used to find the appropriate depth and scaling (1.01 in this case, and 
across all recordings 1.08 + 0.02 (mean +s.d.)). We found the shrinkage 
factor of our fixed tissue relative to the atlas to be 8% on average, but 
estimated it ona brain-by-brain basis when aligning electrode tracks. 

Finally, we quantitatively estimated the accuracy of this longitudi- 
nal alignment procedure by taking advantage of the fact there were 
more points constraining the alignment than there are parameters. 
To estimate alignment errors, we used a cross-validation approach: 
we fit the longitudinal mapping using all landmarks except one, and 
estimated how far the predicted location of this held-out landmark 
is from its true location (Extended Data Fig. 2f). We found that the 
median absolute deviation of these errors was 88.7 um, comparable to 
the width of the probe shank itself (70 pm), that is, near the resolution 
limits of the method. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 

The behavioural and neural datasets generated and analysed in this 
study are available as downloadable files at https://figshare.com/arti- 
cles/steinmetz/9598406 and via the Open Neurophysiology Environ- 
ment interface at https://figshare.com/articles/steinmetz/9974357. 


Code availability 


The code used to analyse the data are available at https://github.com/ 
nsteinme/steinmetz-et-al-2019. 


44. Steinmetz, N. A. et al. Aberrant cortical activity in multiple GCaMP6-expressing 
transgenic mouse lines. eNeuro 4, 0207-17.2017 (2017). 

45. Guo, Z. V. et al. Flow of cortical activity underlying a tactile decision in mice. Neuron 81, 
179-194 (2014). 

46. Bhagat, J. et al. Rigbox: an open-source toolbox for probing neurons and behavior. 

Preprint at https://www.biorxiv.org/content/10.1101/672204v3 (2019). 

47. Ludwig, K. A. et al. Using a common average reference to improve cortical neuron 

recordings from microelectrode arrays. J. Neurophysiol. 101, 1679-1689 (2009). 

48. Park, I. M., Meister, M. L. R., Huk, A. C. & Pillow, J. W. Encoding and decoding in parietal 

cortex during sensorimotor decision-making. Nat. Neurosci. 17, 1395-1403 (2014). 

49. Ashe, J. & Georgopoulos, A. P. Movement parameters and neural activity in motor cortex 

and area 5. Cereb. Cortex 4, 590-600 (1994). 

50. Paninski, L., Shoham, S., Fellows, M. R., Hatsopoulos, N. G. & Donoghue, J. P. Superlinear 

population encoding of dynamic hand trajectory in primary motor cortex. J. Neurosci. 24, 

8551-8561 (2004). 

51. Izenman, A. J. Reduced-rank regression for the multivariate linear model. J. Multivariate 
Anal. 5, 248-264 (1975). 

52. Mathis, A. et al. DeepLabCut: markerless pose estimation of user-defined body parts with 
deep learning. Nat. Neurosci. 21, 1281-1289 (2018). 

53. McGinley, M. J., David, S. V. & McCormick, D. A. Cortical membrane potential signature of 
optimal states for sensory signal detection. Neuron 87, 179-192 (2015). 

54. Treves, A. & Rolls, E. T. What determines the capacity of autoassociative memories in the 
brain? Netw. Comput. Neural Syst. 2, 371-397 (1991). 

55. Shamash, P., Carandini, M., Harris, K. D. & Steinmetz, N. A. A tool for analyzing electrode 
tracks from slice histology. Preprint at bioRxiv https://doi.org/10.1101/447995 (2018). 


Acknowledgements We thank A. J. Peters, M. Pachitariu and P. Shamash for software tools; H. 
Forrest for help with data preprocessing; C. Reddy, M. Wells and L. Funnell for help with mouse 
husbandry, training and histology; R. Raghupathy for help with histology; C. P. Burgess for help 
with experimental apparatus; T. Harris and B. Karsh for support with Neuropixels recordings; M. 
Hausser, A. J. Peters and S. Schroeder for feedback on the manuscript. This project was funded 
by the European Union’s Marie Sktodowska-Curie program (656528), the Human Frontier 
Sciences Program (LTO01071/2015-L), the Wellcome Trust (205093, 102264), the European 
Research Council (694401), the Gatsby Foundation (GAT3531) and the Simons Foundation 
(325512). M.C. holds the GlaxoSmithKline / Fight for Sight Chair in Visual Neuroscience. 


Author contributions N.A.S., M.C. and K.D.H. conceived and designed the study. N.A.S. 
collected data. N.A.S., P.Z.-H. and K.D.H. analysed data. N.A.S., M.C. and K.D.H. wrote the 
manuscript. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-019- 
1787-x. 

Correspondence and requests for materials should be addressed to N.A.S. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


a Cori (526 trials, 3 sessions b Forssmann (904 trials, 4 sessions) c Hench (1175 trials, 4 sessions) 
pedestal data —model 
Left NoGo Right 
choice A choice ot he 
0% 
a , 50% HH a a bey ai N\ but re il 
24 1 1 
Bos } } re f 05 + + 
© 9 [100% 0 t 4 a 
a it) 
-100 0 100 -100 0 100 -100 0 100 
A Contrast (%) 
d Lederberg (1770 trials, 7 sessions) e Muller (616 trials, 3 sessions) f Radnitz (764 trials, 5 sessions) 
1 1 ry 1 
0.5 + ¢ 0.5 0.5 + i 
0, + 0 4 t 0 a 
-100 0 100 -100 0 100 -100 0 100 
g Richards (880 trials, 5 sessions) h Tatum (813 trials, 4 sessions) ! Moniz (497 trials, 3 sessions) 
“es oa “tia a Ee 
a 1 t 1 
05 05 05 t 
0 t 4 t 0 4 t 0 4 4 
-100 0 100 -100 0 100 -100 0 100 
j Theiler (269 trials, 1 session) k Combined subjects | 
Ph ff ‘ pedestal 0% Performance at 100% contrast 
1 
e * t 
© ¢ ¢ 
LA, 3 eigtt yete't 
8 Ee 
2 8 
2 05 205 
thi E | rer 
4 5 3s chance = 33% 
= © a 
a a 
PAD 1 . era 
a -100 ie} 100 < . nS 
a 9 S s& es x 
“00.0 100 Contrast (%) LAG oe w oe KO 
A Contrast (%) & N 
m Example segment of rotary encoder data ° Examples aligned to se seine, Hagia 
movement onset movement onset, zoom 
= 300 1007 | 2 
£ 200 80 E14 | 
§ 100 60 —E 
o O|-« «ss 5& 0 
a -s00} \ - mh = + Detected onset € si a 
4 4 nm n " + = 20 e 
n 5 0 
@ 300 = oo q 
= 200 a 8 
E 100 t fh b 40 2s 
zo a M -60 8 £0. 
8 -100 Ww Hy \V 85 
& -200 -80 ag 
er: 250 255 260 265 270 100" "02 04 Bi "0.05 0 
Time (s) Time from detected onset (s) Time from detected onset (s) 


Extended Data Fig. 1| See next page for caption. 
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Extended Data Fig. 1| Behavioural performance as psychometric curves for 
each subject, and analysis of wheel movements. a, Psychometric curves for 
mouse Cori, showing the probability of choosing left (blue), right (orange) or 
NoGo (black) asa function of stimulus contrasts on the left and right screens. 
Each rowcorresponds toa pedestal contrast (the minimum contrast on the left 
and right screens). The horizontal axis encodes the relative contrast from the 
pedestal value, positive numbers indicating higher contrast on the right 
screen, and negative numbers for higher contrast on the left screen (for 
example, at pedestal = 50%, a AContrast of +50% corresponds to trials with 50% 
contrast on the left screen and 100% contrast on the right screen). Dots and 
vertical lines indicate the empirical fraction of choices made and 95% binomial 
confidence intervals for the fraction estimate, pooling data over sessions. 
Curves indicate the fit of a multinomial logistic model: In wee =b,+5,C75 
Ine = Dat Spc in which c, and c,are the contrast on the left and right, and 
parameters b,,s,,n, bgand spare fit by maximum likelihood estimation to the 
data for each subject®. b-j, Asina, for the remaining subjects. k, The model fit 
for all subjects overlaid, for left choices (blue) and right choices (orange), in 
both cases for pedestal = 0%.1, Summary of performance on high-contrast 


trials. Dots reflect the session-pooled proportion correct of each mouse for 
trials with 100% versus 0% contrast, with 95% binomial confidence interval. m, 
Example segment of wheel position data showing wheel movements detected 
as left turns (blue), right turns (orange) or incidental movements (black). 
Detected onsets (green circles) and offsets (red circles) marked for each 
movement. y-axis scale: distance moved at the circumference of the wheel (that 
is, 216, in which Ris wheel radius and @is its angular position). n, Wheel 
velocity trace for the same segment of dataasina.o, Example wheel turns 
aligned to the detected onset time. The dashed box indicates the region 
expanded in p. p, Example wheel turns aligned to detected onset time, zoomed 
to show the moment of takeoff, illustrating that the wheel had moved by less 
than 0.5mm by onset. The step-like appearance of the trace reflects the 
resolution of the rotary encoder (each step unit is 0.135 mm at the surface of 
the wheel). q, Decoding the eventual direction of the wheel movement using 
the instantaneous velocity at different times relative to detected movement 
onset reveals that the direction only starts to be decodable around 20 ms 
before detected onset, and is not reliably (80%) decoded until the time of 
onset. Error bars represent s.d. across sessions (n=39). 
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Extended Data Fig. 2| See next page for caption. 
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Extended Data Fig. 2| Method for histological alignment. a, Before insertion, 
probes were coated with Dil. The brain was sliced and imaged, and locations of 
each probe’s Dil spots were manually identified on the Allen CCF atlas (15.14 6.9 
spots per probe). When multiple penetrations were performed ina single brain, 
their tracks were sufficiently far apart to avoid confusion. b, A vector is fit tothe 
probe track using total least squares linear regression. The median distance of 
individual points from this vector is 39.3 ym, providing an estimate of lateral 
displacement error.c, To fit the longitudinal mapping from recording sites to 
brain locations, we used landmarks that were easily detectable by their 
electrophysiological signatures (arrows, left), linearly interpolating the 
location of sites between these landmarks. d, Visual receptive fields served asa 
post hoccheck oncorrect alignment, but were not used to estimate track 
location. Each horizontally elongated plot with two vertical black lines 


indicates the responsiveness of all spikes recorded in an 80-um-depth binto 
flashed white squares at varying locations on the three screens (see Methods, 
Receptive field mapping). Colour map indicates spike rate, independently 
scaled for each map. e, Areas assigned for each recording site. Right, example 
Dil traces in slices corresponding to these locations. f, Example of cross- 
validation procedure to assess error in longitudinal alignment. For each point, 
the longitudinal mapping was recomputed excluding this point, and the 
distance from this point to the mapping fit to other points provides an estimate 
of longitudinal alignment error. Brain diagrams were derived fromthe Allen 
Mouse Brain Common Coordinate Framework (v.3 (2017); downloaded from 
http://download.alleninstitute.org/informatics-archive/current-release/ 
mouse _ccf/). 
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Extended Data Fig. 3| Examples of Dil tracks showing recording sites from 
the depicted sub-surface brain regions in aligned histology. Visual 
inspection of the Dil tracks confirms that the probe indeed passed through 
that region at some point along the recording span. Thick white lines outline 
the region given in the plot title; grey lines outline other regions; and white 


arrows point to places where probe tracks were found within the given region. 
Blue: DAPI; green: GCaMP; red: Dil. Brain diagrams were derived from the Allen 
Mouse Brain Common Coordinate Framework (v.3 (2017); downloaded from 
http://download.alleninstitute.org/informatics-archive/current-release/ 
mouse _ccf/). 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4 | Global neuronal activity and distribution. a, Activity 
of example neurons in VISp and VISam, showing the neuron’s waveform and 
anatomical location (top), rasters sorted by contralateral contrast (middle), 
and trial-averaged firing rates (smoothed with 30 ms causal half-Gaussian) for 
each of the four contralateral contrasts (bottom). Shaded regions show the 
s.e.m. across trials. b, Colour map showing trial-averaged firing rates of all 
highly activated neurons (P<10™* compared with pre-trial activity), vertically 
sorted by firing latency. Latency sorting was cross-validated: latencies for each 
neuron were determined from odd-numbered trials, and activity from even- 
numbered trials is depicted in the plot. The grey scale represents average 
normalized firing rate across even-numbered trials with contralateral visual 
stimuli and contralateral choice. c—-e, Curves showing mean firing rate across 
responsive neurons in each area, aligned to visual stimulus onset (c), 
movement onset (d) or reward onset (e). Trials were included ine whena reward 


was earned for keeping the wheel still following zero-contrast stimuli. Shaded 
regions show thes.e.m. across neurons. f, The focality index, defined as 

Y (p2)/ Y (p,)°, in whichp, is the proportion of neurons in area a selective for the 
kernel in question, measures how widely versus focally distributed a 
representationis, witha floor of 0.0238 for a uniform distribution (across 42 
brain regions) anda maximum of 1.0 if all selective neurons were found ina 
single brain region. This focality index was 0.079 for Choice, 0.069 for Vision 
kernels and 0.040 for Action kernels; the differences between Choice and 
Action, as well as contralateral Vision and Action, were statistically significant 
(P< 0.05; bias-corrected bootstrap). Dots represent the true value and error 
bars represent bias-corrected bootstrap-estimated 95% confidence intervals. 
Brain diagrams were derived from the Allen Mouse Brain Common Coordinate 
Framework (v.3 (2017); downloaded from http://download.alleninstitute.org/ 
informatics-archive/current-release/mouse_ccf/). 
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Extended Data Fig. 5| See next page for caption. 


Extended Data Fig. 5| Comparison of the reduced-rank kernel regression 
method to other methods for spike train prediction. a, Example fit of spiking 
data for an individual neuron with the kernel model. Green trace shows spike 
data smoothed witha causal filter, black shows the model’s prediction, and 
other coloured traces show the components of the prediction from each 
kernel. Data between trials is omitted from the fitting and from this plot. 

b, Cartoon of the three methods evaluated. In the Toeplitz and Cosine models, a 
predictor matrix X of size Nyimepoints * Npredictors iS Constructed from task events 
(illustrated, transposed, ind). A linear fit from predictors X to spike counts Yis 
estimated using elastic net regularization. In the reduced-rank regression 
method, the predictor matrix Xis the same as the Toeplitz model, but predicts 
Yafter passing through alow-rank bottleneck (X-b), whichis optimized using 
reduced-rank regression. c, Relationship between Action and Choice kernels, 
which are added or subtracted together to give the shapes for left and right 
choice trials. This allows separation of neurons with choice from action 
correlates, while still allowing for arbitrary-shaped responses on left and right 
trials. d, Structure of predictor matrices (shown transposed). The Toeplitz 
predictor has rows for each variable and time offset, which take non-zero values 
for time points (columns) corresponding to the appropriate time offset from 
the given event. The cosine model has similar structure but with rows replaced 
by smooth raised cosine functions, allowing a smaller number of basis 


functions. The reduced-rank regression model has learned asmall number of 
dense basis functions optimized to predict spike counts. e, Density scatterplot 
of cross-validated variance explained for each neuron under the Toeplitz 
model against the reduced-rank model (top), and for the cosine model versus 
the reduced-rank model (bottom). Each point represents one cell, coloured to 
show density when they overlap. Right, magnified view of the densest region of 
the plot. These comparisons show that the reduced-rank model consistently 
outperforms the other two (points lie below the diagonal), and that it overfits 
fewer neurons (fewer points with CV variance explained <0). f, The proportion 
of overfit-explained variance, that is, (CVerain — CVtest)/CVerain in WHICH CVirain iS 
the train-set variance explained and CV,,., is the test-set variance explained. 
Smaller values for the reduced-rank model show that it overfits less. g, Left, 
population decoding of contralateral visual stimulus contrast from residual 
population activity in each area after subtracting the prediction of amodel 
including all other kernels. The other panels depict the same analysis for 
decoding of ipsilateral visual stimulus contrast, action and direction of choice. 
h, Distributions of the cross-validated proportion variance explained for each 
neuron when shuffling left and right trial choice labels (orange) together with 
the distribution for the original data (blue). After shuffling, asmall number (14, 
0.33%) of neuronsare false positives by this threshold. The dashed line 
represents the 2% CV variance explained threshold used. They axis is clipped. 
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Extended Data Fig. 6| See next page for caption. 


Extended Data Fig. 6 | Summary of variance explained by the kernel model 
and population average responses on Go, Miss, and Passive trials. a, The 
unique contribution of each predictor variable as assessed by nested 
prediction. Each panel depicts the distribution of variance explained across 
neurons of asingle brain region, using various reduced-rank kernel regression 
models, (compare with Figs. 3c, e, 4b). Each bar shows the 10th, 25th, 50th, 75th 
and 90th percentiles of the distribution for a single prediction model, colour- 
coded by model identity. The numbers in the subplot title indicate the number 
of all neurons analysed with the full model (that is, the distribution shown with 
the grey bar), and the number of neurons included for nested model analysis 
(that is, cells with >2% variance explained with the full model). The black bar 
shows distribution of variance explained by the full model in this subset; 
coloured bars show the unique contribution of each predictor. Note that the 


unique contributions need not sum to the variance of the full model, as 
predictor variables are correlated. Variance explained by the Action kernel 
(yellow) is essentially global, whereas contralateral Vision variance explained is 
distinctly restricted, and Choiceis rare enough to be difficult to see inthese 
plots. b, Population average firing rates across neurons for each brain regionin 
Go, Miss, and passive trials, selected to have matched contralateral visual 
stimulus contrasts. The patterns characteristic of engagement can beseenin 
pre-stimulus activity (that is, before time 0): the pre-stimulus firing rate of 
midbrain, basal ganglia, and hippocampal regions is in the order Go > Miss > 
passive; whereas pre-stimulus activity in neocortical areas instead is arranged 
as passive > Miss and Go. Thalamic regions can exhibit either pattern; notably, 
visual thalamic regions (LGd, LP and LD) follow the pattern of neocortical areas. 
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Extended Data Fig. 7 | See next page for caption. 


Extended Data Fig. 7| Choice probability and detect probability analysis. a, 
The percentage of neurons with significant ccCP (that is, neurons whose rate 
differed significantly between left and right choices in response to the same 
stimulus; left two columns) and ccDP (that is, neurons whose rate differed 
significantly between Go and NoGotrials in response to the same stimulus; 
right two columns) asa function of time aligned to visual stimulus onset (left) 
and movement onset (right). The horizontal dashed line represents the value 
expected by chance given the statistical threshold alpha = 0.05. b, Percentage 
of neurons in each area with significant detect probability between -50 and 
+50 ms from movement onset, replicating the finding from Fig. 2d, e, 3e that 
non-selective action signals are distributed widely. Asterisks indicate brain 
regions for which 95% confidence intervals for the percentage of significant 
neurons (black error bars) did not include the chance value (5%, horizontal 
dashed line). c, Asinb, for choice probability in the same window. The number 
of trials usable in this analysis is limited, meaning that some sessions (n= 6 of 
39) had to be excluded; nevertheless, this analysis broadly replicates the 
finding from Fig. 4b that around the time of movement onset, choice-selective 
neuronsare restricted to frontal cortex, basal ganglia, midbrain and certain 
thalamic nuclei. d, Asina, for choice probability ina window +150 to 250 ms 


after movement onset, showing that by this time choice-related signals are 
distributed more widely, including visual and parietal cortex. These signals are 
too late to have participated in generating the choice but could reflect either 
corollary discharge or sensory reafference. However, they cannot reflect 
movement of the visual stimulus on the screen, as it is fixed during this time 
period. e, The percentage of neurons with significant choice probability, asa 
function of time relative to movement onset for selected areas (zoom and 
overlay of certain traces froma), replicating that choice related activity is first 
seen inthe final 50-100 ms relative to movement onset and with similar timing 
across multiple areas. Note that six sessions were excluded from ccCP analysis 
for having too few trials; these six sessions included 20.8% of the MOs neurons 
determined to have choice-selective responses with kernel regression. f, The 
pre-stimulus detect probability (after subtracting 0.5, so that positive values 
indicate higher rates on Gotrials, and negative values the reverse) versus the 
mean Go - Miss firing rate difference for each area (used in Fig. 5d), 
demonstrating that these two quantities identify essentially the same factor. 
The pre-stimulus detect probability was correlated with the engagement index 
(that is, task — passive difference) similarly to the Go — Miss difference (with 
r=0.48, P=0.001; not shown). 
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Extended Data Fig. 8| See next page for caption. 
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Extended Data Fig. 8 | jPECC analysis for determining whether correlations 
occur with atemporal offset between a pair of regions. a, Canonical 
correlation analysis is applied to firing rates at every pair of time points relative 
toa behavioural event (illustration shows 0.1s after stimulus onset in VISp and 
0.15 s after in MOs). Canonical correlation analysis is applied to the pair of 
matrices containing each cell’s firing rate at selected times on each training set 
trial (90% of the total) to find dimensions in each population maximally 
correlated with each other. (For regularization purposes, this is applied after 
dimensionality reduction using principal component analysis). The strength of 
population correlation is summarized by the correlation of test set activity 
projected onto the first canonical dimension. b, Results on an example session, 
showing relationships between visual cortex, midbrain and frontal cortex 


relative to stimulus onset (top) and movement onset (bottom). Visual cortical 
activity leads frontal and midbrain activity, as can be seen from the below- 
diagonal bias in correlations. However, no lead/lag relationship is seen 
between midbrain and frontal cortex. Grey, P>0.05.c, Average across all 
recording sessions that contained each pair of areas, showing similar 
relationships to the example in each case. d,e, Summary of lead-lag 
interactions, obtained by subtracting the averages of the jPECC coefficients 
over inter-area time ranges of -50 to0 and Oto 50 ms, asa function of time 
relative to stimulus onset (d) or movement onset (e). Grey region, 2 s.e.m. 
across experiments. Visual cortex reliably leads frontal cortex and midbrain at 
around 100 ms after the stimulus; and over a range —-200 to -50 ms relative to 
movement. 
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Extended Data Figure 9 | See next page for caption. 
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Extended Data Figure 9 | Statistical analysis of engagement index and 
influence ofalertness-, reward- and history-related variables on pre- 
stimulus firing rates. a, Anested ANOVA with factors of session and subject 
was used toassess statistical significance of pre-stimulus task-passive firing 
rate differences (here normalized, unlike Fig. 5c, dand Extended Data Fig. 6b, 
to meet statistical assumptions) in each brain region (see Methods). All non- 
neocortical regions that showed a significant difference between engaged task 
and passive states had higher mean pre-stimulus firing rates in task context, 
except for visual thalamus. All neocortical regions that showed a significant 
difference between task and passive contexts had lower mean pre-stimulus 
firing rates in the task context. b, An engagement index was computed for each 
task trial by projecting pre-stimulus population activity onto the vector of 
differences between pre-stimulus activity in task and passive contexts. Task 
trials with lower engagement index therefore showed a pattern of pre-stimulus 
population activity more similar to the passive context. Histogram shows the 
distributions of this index over contrast-matched Miss and Go trials; Pvalue 
computed by ¢-test. c-f, Same plot after restricting to contrast-matched trials 
following rewards (c), after removing the reward effect by partial regression 
(that is, by subtracting the mean within trials of each previous reward 
condition) (d), after restricting to contrast-matched trials following short inter- 
trial intervals (e) or after restricting to contrast-matched trials following long 
inter-trial intervals (f). The effect persists in each case. g, Histogram of pupil 
areas inthe pre-stimulus period after previous trials that were rewarded or non- 
rewarded, showing the expected effect of reward on arousal asa positive 
control for the validity of pupil diameter measurements. h, To initiate the next 
trial, subjects must hold the wheel still for 500 ms; video analysis shows that 
they reduce other movements as well. Top, total video motion energy (mean- 
square frame difference) as a function of time relative to stimulus onset, on 


each trial, for an example recording. Bottom, mean motion energy across these 
trials overlaid (red-shaded region represents s.e.m. across trials). Inset, 
example frame from video monitoring the face and forelegs of the mouse. 

i, Results of a logistic GLM predicting the probability of aGo response onthe 
subsequent trial from each of the given variables; plot format as in Fig. 5e. The 
null hypothesis that engagement index had no additional effect onGo 
probability over all other variables was rejected using a deviance test 

(P=1.5 10°). Each panel’s curve shows effect of one individual variable on Go 
probability. Red points, mean Go probability averaged over a bin; red error 
bars, 95% confidence interval; black line, fit of GLM, setting all other variables 
to their mean; grey shading, 95% confidence interval.j, Average Go probability 
andGLM fit as a function of engagement index (x axis), and previous trial 
reward (colour). Correlation of P(Go) from engagement index persists despite 
the additional effect of previous trial’s reward. k, To additionally test whether 
engagement index more specifically relates to P(Go) than any other variable, 
we asked whether the engagement vector (that is, the mean difference in pre- 
stimulus population activity between task and passive contexts) matches the 
population vector encoding differences before Go and Miss trials, better than 
it does differences in other behavioural variables. To doso, we computed the 
Pearson correlation between the engagement vector and the Go-Miss vector 
for each recording. This correlation coefficient can be interpreted as the 
cosine of the angle between the two vectors in Nyeurons dimensional space. Each 
panel shows ascatter plot, with one dot per recording, comparing this 
correlation against the engagement vector’s correlation with vectors defined 
for each other behavioural variable. In each case we find that the correlationis 
greater for the Go-Miss vector (not reaching significance at alpha = 0.05 level, 
however, for the comparison with the previous reward vector, P=0.096). 
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Extended Data Table 1| Brain regions recorded 


Abbreviation Full name N recordings N mice N responsive neurons Ntotalneurons % 
responsive 

ACA Anterior cingulate area 11 8 395 608 65.0 
ACB ACB Nucleus accumbens 3 3 152 255 59.6 
APN APN Anterior pretectal nucleus 2 2 177 235 75.3 
BLA Basolateral amygdalar nucleus = 2 107 261 41.0 
CA1 Field CA1 21 10 494 1129 43.8 
CA3 Field CA3 10 6 243 438 55.5 
cP Caudoputamen 5 5 524 914 57.3 
DG Dentate gyrus 16 10 336 711 47.3 
GPe Globus pallidus external segment 3 3 146 274 53.3 
ILA Infralimbic area 3 3 192 338 56.8 
LD Lateral dorsal nucleus of the thalamus 6 5 209 308 67.9 
LGd Dorsal part of the lateral geniculate complex 10 5 397 811 49.0 
LP Lateral posterior nucleus of the thalamus 11 8 393 732 53.7 
LS Lateral septal nucleus 7 5 508 844 60.2 
MD Mediodorsal nucleus of the thalamus 3 3 244 381 64.0 
MG Medial geniculate complex of the thalamus 2 2 180 276 65.2 
MOp Primary motor area 3 3 553 682 81.1 
MOs Secondary motor area 19 9 993 1534 64.7 
MRN Midbrain reticular nucleus 11 7 722 857 84.2 
OLF Olfactory areas 9 5 210 684 30.7 
ORB Orbital area 6 5 281 770 36.5 
PAG Periaqueductal gray 3 3 72 130 55.4 
PL Prelimbic area 10 7 438 728 60.2 
PO Posterior complex of the thalamus 5 4 342 620 55.2 
POL Posterior limiting nucleus of the thalamus 3 3 132 190 69.5 
POST Postsubiculum 4 3 163 272 59.9 
RSP Retrosplenial area 9 5 308 598 aS 
RT Reticular nucleus of the thalamus 2 2 111 160 69.4 
scm Superior colliculus motor related 11 6 738 997 74.0 
SCs Superior colliculus sensory related 10 6 192 317 60.6 
SNr Substantia nigra reticular part o 3 201 274 73.4 
SSp Primary somatosensory area 5 4 296 461 64.2 
SUB Subiculum 9 7 494 669 73.8 
viSa Anterior visual area 5 4 207 285 72.6 
viSam Anteromedial visual area 11 8 501 805 62.2 
VISI Lateral visual area 3 2 248 403 61.5 
VISp Primary visual area 12 8 649 923 70.3 
VviSpm Posteromedial visual area 4 4 230 516 44.6 
ViSrl Rostrolateral visual area 2 2 85 236 36.0 
VPL Ventral posterolateral nucleus of the thalamus 4 3 202 255 79.2 
VPM Ventral posteromedial nucleus of the thalamus 4 2 235 357 65.8 
Zl Zona incerta 4 2 167 220 75.9 
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Data exclusions | Behavioral trials were excluded when there were three or more miss trials in a row indicating disengagement; this criterion was pre- 
established. Neurons were excluded when the null hypothesis that their firing rate was un-modulated during the task could not be rejected 
according to a series of four tests for different types of modulation; this criterion was not determined in advance but was developed from 
data inspection. 
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Randomization _ Trial types during the behavioral task were interleaved randomly. 
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Embryonal tumours with multilayered rosettes (ETMRs) are aggressive paediatric 
embryonal brain tumours with a universally poor prognosis’. Here we collected 193 
primary ETMRs and 23 matched relapse samples to investigate the genomic landscape 
of this distinct tumour type. We found that patients with tumours in which the 
proposed driver C1I9MC?* was not amplified frequently had germline mutations in 
DICER1 or other microRNA-related aberrations such as somatic amplification of miR- 
17-92 (also known as MIR17HG). Whole-genome sequencing revealed that tumours had 
an overall low recurrence of single-nucleotide variants (SNVs), but showed prevalent 
genomic instability caused by widespread occurrence of R-loop structures. We show 
that R-loop-associated chromosomal instability can be induced by the loss of DICER1 


function. Comparison of primary tumours and matched relapse samples showed a 
strong conservation of structural variants, but low conservation of SNVs. Moreover, 
many newly acquired SNVs are associated with a mutational signature related to 
cisplatin treatment. Finally, we show that targeting R-loops with topoisomerase and 
PARP inhibitors might be an effective treatment strategy for this deadly disease. 


ETMRisa malignant type of brain tumour that occurs almost exclu- 
sively in young children’. The tumours show diverse histological 
patterns that have been described as ependymoblastoma (EBL), 
medulloepithelioma (MEPL) or embryonal tumour with abundant 
neuropil and true rosettes (ETANTR), which together form one dis- 
tinct biological tumour type termed ETMR’». Genetically, ETMRs are 
characterized by amplification and fusion of a microRNA (miRNA) 
cluster on chromosome 19 (C19MC) with 7TYH1, which is present in 
approximately 90% of the tumours? “*. As current treatment options 
fail for nearly all patients, a better understanding of ETMR tumori- 
genesis is needed to develop newtreatment strategies. Here we char- 
acterized the molecular landscape of a large cohort of patients with 
primary and relapsed ETMRs to gain more insight into the inter- and 
intratumoral heterogeneity of ETMRs at diagnosis and relapse, and to 
identify tumour-driving mechanisms that may lead to more-effective 
treatment strategies. 


Intertumoral heterogeneity 


Cluster analysis of ETMR DNA methylation profiles (n = 193; Supple- 
mentary Table 1) or mRNA data (n = 28), confirmed that ETMRs are 
clearly distinct from other brain tumour types°’ (Fig. 1a, b). ETMRs 
without C19MC amplification (n = 23) tend to cluster together at the 
edge of the main ETMR cluster but do not really separate, even when 
clustering only ETMRs (Extended Data Fig. 1a). Additionally, miRNAs 
are expressed in a distinct pattern in ETMRs, but are similar between 
ETMRs with (n=7) and without (n =3) C19MC amplification (Extended 
Data Fig. 2).C19MC miRNAs are also expressed in ETMRs without CI9MC 
amplification, albeit at an approximately tenfold lower level, but not 
in normal brain or other brain tumours (Fig. 1c). Mature miRNAs that 
are specifically upregulated in ETMRs included all CI9MC miRNAs and 
members of the miR-17-92 miRNA cluster, whereas several members of 
the let-7 family of miRNAs are specifically downregulated in ETMRs 
(Supplementary Table 2). 


A list of affiliations appears at the end of the paper. 
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Fig. 1|ETMRs regardless of CI9MC amplification show high molecular 
similarity. a, b, ‘Stochastic neighbourhood embedding clustering using 
either DNA methylation (a) or mRNA expression (b) data. Colours represent 
different tumour entities classified as described previously’. Definitions of the 
groupsare provided in Supplementary Table 10. c, Violin plot showing log,- 


ETMRs occur throughout the brain, without any association with 
different histological variants, although tumours without C19MC ampli- 
fication were significantly more often located ininfratentorial regions 
(Fisher’s exact test, P= 8.80 x 10’). None of the other clinical annota- 
tions was associated with any molecular subgrouping (Extended Data 
Fig. 1a, b). Taken together, these data show that ETMRs area molecularly 
distinct tumour type, with limited molecular intertumour heterogene- 
ity despite diverse histopathological features, absence of C1I9MC ampli- 
fication in a subset of tumours and a variety of anatomical locations. 


Intratumour heterogeneity 


By comparing ETMR mRNA data against healthy brain tissue and 
other brain tumours (Supplementary Table 3), we identified several 
upregulated pathways that are related to development, including RNA 
processing, and Hippo, WNT, Notch and SHH signalling (Extended 
Data Fig. 3). Additionally, compared to other tumours, most ETMRs 
showed upregulation of DNA repair pathways, including base excision 
repair, Fanconi anaemia and the p53 pathway’ (Extended Data Fig. 4a 
and Supplementary Table 4). However, a subset of tumours (10 out 
of 28) lacked this upregulation. As no distinct molecular subgroups 
were detected, we investigated whether this was due to differences 
in intratumour heterogeneity. CIBERsort analysis was used to com- 
pare tumour expression data with single-cell RNA-sequencing data 
of the developing prefrontal cortex to delineate distinct tumour cell 
populations’”’. This resulted in the identification of stem-cell-like, 
more-differentiated oligodendrocyte precursor-like and astrocyte-like 
tumour cells (Extended Data Fig. 4b). A higher proportion of differen- 
tiated progeny corresponded to lower levels of the stem cell markers 
LIN28A and HMGAZ, lower levels of the DNA-damage checkpoint genes 
WEE1 and CHEK2, but higher levels of the astrocyte markers AQP4 and 
GFAP (Extended Data Fig. 4c). 

Notably, these observations correlated with the histology of the 
tumours. Tumours with more-differentiated cells and lower expression 
of DNA repair genes tended to be more often diagnosed as ETANTRs, 
while tumours with less-differentiated cells and higher expression 
of DNA repair genes were more likely to be described as EBL or MEPL 
(Extended Data Fig. 4d). These findings were validated by methylation 
profiling and RNA sequencing of microdissected neuropil (more dif- 
ferentiated) and rosettes (less differentiated) of an ETMR diagnosed 
as ETANTR. Although RNA sequencing confirmed that the expres- 
sion of stem cell and astrocyte markers differed between the two cell 
populations, methylation profiles were highly similar and clustered 
together with other ETMRs (Extended Data Fig. 4e-g and Supplemen- 
tary Table 3). 

Previously, we reported that L/N28A is more widely expressed upon 
relapse and histology shifts towards amore EBL or MEPL phenotype’. 


transformed expression of CI9MC miRNAs (n=56) inETMRs and other tissues. 
Pvalues were calculated using two-sided Mann-Whitney U-tests; ***P< 0.0005. 
Box plots show the median + interquartile range, whiskers extend to1.5x the 
interquartile range, violin plots depict kernel density estimates and represent 
the density distribution. 


To investigate whether this is due to the outgrowth of the stem-cell-like 
population after treatment, we compared the expression of LIN28A, 
HMGA2, WEE1, CHEK2, AQP4 and GFAP between a primary ETANTR and 
two matched recurrences and found that in both relapse samples the 
gene expression is shifting to be more stem-cell-like (Extended Data 
Fig. 4h). Differences in expression of DNA repair genes and histology are 
therefore likely to be explained by the levels of differentiated progeny, 
while undifferentiated cells with high levels of DNA repair expression 
grow out upon relapse. 


Recurrent aberrations 


We sequenced 82 tumours, including 16 cases without C19MC amplifica- 
tion and 12 recurrences (Methods and Supplementary Tables 5, 6), tosee 
whether there were any recurrent DNA aberrations other than C19MC 
amplification. Other recurrently mutated genes that were detected in 
the primary tumours included D/CERI1 (8 out of 70), CTNNBI (7 out of 
70) and TP53 (5 out of 70), whereas mutations (other than CTNNB1) in 
developmental pathways, which have previously been suggested to 
have a role in ETMR tumorigenesis”, were detected only sporadically 
(Fig. 2a). 

Notably, all D/CER1 mutations occurred in cases that lacked the 
C19MC amplification. All eight samples had compound heterozygous 
mutations, including one somatic mutation (E1705K (three samples), 
D1709N (two samples), G1I809R, E1813G and E1813K) in the hotspot 
region that affects the RNASE IIIb domain, whichis important for miRNA 
processing”. The second mutation was found in the germline (validated 
for seven out of eight samples) and was in all cases deleterious (Fig. 2b). 
Sequencing of mature miRNAs indeed showed a strongly increased 
ratio of 3’ over 5’ mature miRNAs in one of the D/CER1-mutated samples 
(ET31) compared to individuals that have no DICERI mutations (Fig. 2c). 

Two ETMRs (ET68 and ET173, which lacked C19MC amplification and 
DICER1 mutations) had amplifications of the miR-17-92 miRNA cluster 
on chromosome 13. In ET68 this cluster was fused to a region on chro- 
mosome 11 to form a circular chromosome, which was confirmed by 
mate pair sequencing, copy-number profiles and fluorescence in situ 
hybridization (Extended Data Fig. 5a, b). Genes onthe circular chromo- 
some, including the miR-17-92 miRNA cluster, showed an approximately 
fourfold higher expression level compared to other ETMRs (Extended 
Data Fig. 5c). The amplification of the miR-17-92 miRNA cluster was also 
found once in an ETMR (ET85) with C19MC amplification. 

Three other samples (ET89, ET160 and ET168, which lacked C1I9MC 
amplification and DICERI mutations) showed clustered breakpoints 
that affected the C19MC locus, suggesting that even though C19MC was 
not amplified, its expression might be driven by structural rearrange- 
ments (Extended Data Fig. 5d). In ET160, we found a fusion between 
C19MC and MYO9B, whichis also located on chromosome 19. Alternative 


Nature | Vol576 | 12 December 2019 | 275 


Article 


a - 
C19MC not amplified C19MC amplified 
miRNA 
related 
anit 
Genome 5 
maintenance | 
" 1 
i 
SHH/WNT alban 
pathways 
i 
Growth/ fl i 
ibaa i 
proliferation 
i 
Othi a I 
er 
Lae Sn 
Available ee er 
data i mt LL 
Lan ii Han TTT | 
Bf Somatic amplification J] Somatic LOF [ LOF 0 Not present 
Bf Somatic SV fi Germline LOF {§ Genomic instability [] Not available 
BE Somatic SNV ® LOH i Data available 
? stEeo 
b CAST KY © ESSES ec 
Rc ? SONS 
SORe ES gf — SeSE” a] 
QUES KE = 6 
8 6 
i th? de? a $ 
DICER1 = 2 
RNA binding + & 
RNase III ee 2 
PAZ —- s 
P-loop —sm- = i od 
Helicase —sasss:——sas- & 
Dimerization 3 | 
@ET31 @ET90 @ET152 @ET161 MET175 @ET189 @ET190 BET191 rr ‘ 
1 Germline i Somatic J Germline not available Brain[ETMR| GBM| MB | PA 


Fig. 2|ETMRs without C19MC amplification recurrently have miRNA related 
aberrations. a, Oncoplot showing somatic events occurring in ETMRs. LOF, 
loss of function; WES, whole-exome sequencing; WGS, whole-genome 
sequencing. b, Overview of identified D/CER1 mutations. Alternating blue 
colours represent exons, yellow bars represent domains and pins represent the 
different aberrations found. c, Quantification of miRNA processing using the 
median ratio between 3p and 5p miRNAs (n= 375); each bar represents one 
tumour; the blue bar denotes an ETMR witha DICERI mutation. Pvalues were 
calculated using a one-sided Mann-Whitney U-test; ***P< 0.0005. 


fusion partners to TTYHI were not unique for cases without the C19MC 
amplification, as mate pair sequencing also identified an alternative 
fusion partner, M/RLET7BHG, in ET20 with C19MC amplification. 

Finally, we identified several patients throughout the cohort with 
severe chromosomal instability, including four individuals without 
C19MC amplification (ET57, ET72, ET74 and ET104; Extended Data 
Fig. Se). Taken together, our data show that ETMRs have very few 
recurrently mutated genes, and are largely characterized by mutu- 
ally exclusive aberrations that affect the miRNA pathway, including 
amplifications of the CI9MC and miR-17-92 clusters and mutations in 
the miRNA-processing gene DICER1. 

In contrast to SNVs and small insertions or deletions, many copy- 
number aberrations (CNAs) were recurrent throughout the cohort 
based on methylation array-derived copy-number profiling (n =193). 
As shown previously, 90% of all ETMRs had C19MC amplification and 
chromosome 2 gain was detected in 76% of the cases. Other recur- 
rent CNAs included chromosome 6q loss (25%), and gains of chromo- 
somes 1q (26%), 17q (11%), 7 (12%), 3q (11%) and 11q (11%). However, no 
significant differences were found between ETMRs with or without 
C19MC amplification (Extended Data Fig. 6a—c). We also compared 
copy-number profiles of 18 matched primary relapse pairs, identify- 
ing frequently acquired CNAs, including 6q loss (22%), and gains of 
chromosomes 1q (33%), 17q (33%) and 7 (17%) (Extended Data Fig. 6d 
and Supplementary Table 7). In addition, polyploidy was detected in 
18% of the primary tumours, and acquired in 28% of the patients upon 
relapse (Extended Data Fig. 6d, e). 

Furthermore, ETMRs sequenced by whole-genome sequencing 
were investigated to detect translocations, inversions, deletions and 
insertions (Supplementary Table 8). Many clustered breakpoints 
that resemble chromothripsis were recurrently found surrounding 
C19MC, although other chromosomes were also sporadically affected 
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Fig.3| Comparison between primary and relapse tumours reveals poor 
conservation of SNVs but high conservation of SVs. a, Allele frequencies of 
combined SNVs found between four primary tumours and the matched initial 
relapses. Boxes represent SNVs that are gained, conserved and lost upon 
relapse. b, Analysis of mutational signatures of primary and relapsed tumours 
based on previously defined mutational signatures“. c, Radial plot depicting 
log,-transformed fold changes of the somatic SNV burden between primary 
tumour (PRIM) and subsequent relapses (REC1 and REC2) coloured by 
exposures shown in b. Number of mutations in the primary tumour was set tol 
for visualization purposes. d, Venn diagram showing the overlap of 
breakpoints between a primary tumour and matched relapses. e, Circular 
representation of the genome of SVs and CNAs ina primary tumour with the 
two matched recurrences shown ind. The circles represent, from outer to 
inner, the CNAs found inthe primary tumour, the first recurrence and second 
recurrence. The middle part represents the different SVs found between 
primary tumours and recurrences. Chromosomes 19 and X have been enlarged. 


(Extended Data Fig. 6f). These breakpoints were also observed in copy- 
number profiles: 16% of the cohort had visible alternating copy-number 
states around C19MC. Together, the high number of CNAs, recurrent 
polyploidy and recurrent complex rearrangements suggest that ETMR 
genomes are structurally instable. 


Primary tumour-relapse comparisons 


To investigate which events are retained in relapse samples and there- 
fore potentially tumour driving, we used whole-genome sequencing 
data of nine matched recurrences and analysed allele frequencies of 
all detected SNVs in primary tumour-relapse pairs (Supplementary 
Table 9). Overall, we found that only 51% (2,138 out of 4,226) of all SNVs 
that are present at allele frequencies of at least 20% in the four matched 
primary tumours are also detectable at allele frequencies of 2% or more 
in the respective matching relapse sample (Fig. 3a). In addition, two 
out of four tumours did not have a single coding non-synonymous 
SNV detected in both primary tumour and first relapse, suggesting 
that acquiring somatic SNVs is not an early (driving) event (Extended 
Data Fig. 7). 

Relapsed tumours have a large increase in somatic SNVs compared 
to primary tumours, suggesting that new mutations may have been 
induced by treatment (Extended Data Fig. 8a). To investigate this, 
we compared mutational signatures between primary tumours and 
recurrences and observeda shift from mainly signature 1 and 16 (both 
associated with ageing) in primary tumours to signature P1 in recur- 
rences, which is a novel signature found in our paediatric pancancer 
cohort, which has not yet been associated with a mutagenesis pro- 
cess®"* (Fig. 3b). To find the potential underlying mechanism, we tested 
for mutational strand bias in somatic SNVs. No bias was detected in 
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Fig. 4 | Breakpoint context reveals a possible role for R-loops in initiating 
ETMRs. a, Schematic representation of Gene Ontology (GO) term enrichment. 
Circles represent GO terms, sizes enrichment and colours groups based on 
similarity scores (co-occurrence association score >0.05) (Supplementary 
Table 3). b, Enrichmentin the DRIP signal around C19MC (fold enrichment over 
input).c, Density of ETMR breakpoints (n = 2,301) overlapping DRIP peaks (left; 
n=16,002) and RLFS (right; n= 85534) compared to random regions of the same 
size. Pvalues were calculated using a two-sided y” test; ***P< 0.0005. 

d, Enrichment of breakpoints overlapping RLFS compared to 10,000 randomly 
generated sets of regions of the same size for ETMRs and other entities”. 
Definitions of the groups are provided in Supplementary Table 10. Boxes show 
the range (median, first and third quartile) of Benjamini-Hochberg-adjusted 
Pvalues calculated using one-sided Fisher’s exact tests, whiskers extend to 
upper and lower limits of the data (90% and 10% respectively). 


primary tumours, however, recurrences showed an increased bias of 
mainly C> A, C>T and T>A mutations towards the transcribed strand 
(Extended Data Fig. 8b), which has previously also been observed in 
tumours treated with platinum-based agents and cyclophosphamide”. 
Comparing matched primary tumours with multiple recurrences con- 
firmed that the composition of SNVs changed after treatment, but 
remained similar in subsequent relapses (Fig. 3c). Furthermore, the 
trinucleotide counts changed towards a signature that is highly similar 
to signature P1 and a cisplatin-induced mutational signature derived 
from cisplatin-treated cell lines’® (Extended Data Fig. 8c, d), suggesting 
that the majority of acquired mutations in relapsed ETMRs is induced 
by cisplatin treatment. 

Despite the poor conservation of SNVs, there was a strong conserva- 
tion of breakpoints between primary tumours and recurrences. On aver- 
age 73% (specifically, 96%, 54%, 63%, 80% for the four pairs included in 
Fig. 3a) of all breakpoints were conserved between primary tumours 
and relapses, including all breakpoints that surrounded and formed 
C19MC aberrations (Extended Data Fig. 7). Inthe same sample analysed 
for shifts in mutational signatures, only one out of 110 breakpoints inthe 
primary tumour was not detected in any recurrence (Fig. 3d, e). These 
data suggest that the formation of structural variants (SVs) is an early 
event in ETMR tumorigenesis. Further supporting this observation 
is an increased density of conserved mutations in close proximity to 
breakpoints, which were enriched for C>T and C>G mutations, which 
have previously been described in association with chromothripsis 
and replication stress” (Extended Data Fig. 9). The process of forming 
breakpoints continues on relapse and relapsed tumours also gain many 
new SVs, which may havea role in tumour progression (Extended Data 
Fig. 7). Therefore, even though we cannot fully exclude that the acquisi- 
tion of sporadic SNVs, also in non-coding regions of the genome, might 
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e, Immunohistochemistry pictures of full slides (left) and magnifications 
(right) for ETMRs (n=5) and arepresentative case of WNT (n=5) and Group4 
MB (n=5). Scale bars, 1mm (top left), 0.5mm (bottom left) and 50 pm (top and 
bottom right). f, Immunohistochemistry of wild-type (WT) Dicerl and Dicerl 
knockout (KO) mouse cell lines stained with DAPI (blue), the S9.6 antibody (red) 
and an anti-y-H2AX antibody (green). Scale bar, 10 pm. g, Quantification of 
signals shown inf, presented as mean signal per cell (n=255 wild-type mouse 
cells, n=227 Dicer1 knockout mouse cells), normalized to the DAPI signal. 
Pvalues were calculated using a two-sided Mann-Whitney U-test; ***P< 0.0005. 
Boxes show median, first and third quartile, and whiskers extend to1.5x the 
interquartile range. h, Dot blot of DNA-RNA hybrids extracted from wild-type 
and Dicerl knockout mouse cells, single-stranded (ss)DNA was used as loading 
control. 


play apart in tumour formation, the high conservation of breakpoints 
and recurrent chromosomal instability are more likely to be driving 
ETMRs than the poorly conserved SNVs. 


R-loop-associated chromosomal instability 


Inrecent years, multiple papers have described an association between 
the formation of R-loops—structures that form upon stalling of RNA 
polymerase, which results ina displaced non-template single-stranded 
DNA loop—and DNA damage’. R-loops, which can form following dis- 
rupted helicase activity, can either facilitate or result from collision 
of transcription and replication and can potentially lead to chromo- 
somal instability’*”’. Notably, expression of DNA/RNA helicases and 
processes associated with helicase activity were highly upregulated 
in ETMRs, suggesting a possible role for R-loops in the formation of 
breakpoints (Fig. 4a). 

To investigate this, we used the prediction of R-loop forming 
sequences (RLFS)”° and performed DNA-RNA hybrid immunoprecipita- 
tion (DRIP) sequencing (DRIP-seq) onthe C19MC-amplified BT183 ETMR 
cellline. To compare how well the datasets matched, we compared the 
RLFS and DRIP-seq data to published DRIP-seq data for Ewing sarcoma”! 
and observed a similar genome-wide pattern (Extended Data Fig. 10a, 
b). R-loopsin ETMRs were mostly observed in regions that surrounded 
the transcription start site, CpG islands and G-quadruplex-forming 
repeats and were also strongly enriched in peaks observed in Ewing 
sarcoma and RLFS (Extended Data Fig. 10c). Overall, we observed a high 
density of R-loops onchromosome 19, but only in ETMRs we observed 
alarge peak surrounding C19MC (Fig. 4b). Genome wide, many break- 
points overlapped with R-loops, and the formation of breakpoints had 
a stronger association with regions that are enriched in R-loops, which 
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Fig.5|ETMR cells are sensitive to combination therapy with PARP and TOP1 
inhibitors. a, Dose-response curves of ETMR cells treated with pamiparib or 
veliparib, topotecan and the combinations of drugs. Pvalues were calculated 
using two-way analysis of variance (ANOVA). Data are mean +s.e.m. 

b, Calculation of synergy using the Chou-Talay method of drug treatments 
shownina.c, Immunohistochemistry of ETMR cells stained with an anti-y- 
H2AX antibody and the S9.6 antibody. Cells were treated using IC; 
concentrations of each drug or combination of drugs. Scale bar, 10 pm. 

d, Quantification of the signal shown inc, presented as mean signal per cell 
(n=59 DMSO-treated cells, n=158 pamiparib-treated cells, n=88 topotecan- 
treated cells, n=20 cells treated with the indicated drug combination), 
normalized to the total DAPI signal in the cell. Pvalues were calculated using 
two-sided Mann-Whitney U-tests; ***P< 0.0005. 


was similar for breakpoints that occurred in Ewing sarcoma but not for 
breakpoints that occurred in other tumour entities” (Extended Data 
Fig. 10d). Therefore, we compared the relative number of breakpoints 
in DRIP-seq peaks and RLFS compared to regions outside R-loops and 
founda strong enrichment of all SV types, including those that do not 
form the C19MC aberrations, in both DRIP-seq peaks and RLFS (Fig. 4c 
and Extended Data Fig. 10b). Compared to multiple other tumour enti- 
ties, the enrichment of breakpoints that fell in R-loops or RLFS was 
specifically present in ETMRs (Fig. 4d). Furthermore, we observed an 
increased density of breakpoints in close proximity to R-loops and 
RLFS compared to other tumours that did not occur for other genomic 
elements (Extended Data Fig. 10e, f); these breakpoints most likely 
resulted from damage induced by stalled replication forks”. Finally, we 
also observed an increased density of SNVs that fell in R-loops and RLFS, 
inline with the observed increased density of SNVs around breakpoints 
(Extended Data Fig. 10g). Together, these data suggest that R-loops 
may indeed play a part in generating breakpoints in ETMR, including 
the breakpoints observed around C19MC. 

To validate the high levels of R-loops in ETMRs, we stained five 
ETMRs for R-loops using the R-loop-specific antibody $9.6 and com- 
pared the pattern of R-loop staining to staining in medulloblastomas 
(n=10) (Fig. 4e). ETMRs were highly positive for R-loops inthe rosette 
structures whereas medulloblastomas were negative throughout the 
entire sections. Most ETMRs with or without C19MC amplification have 
miRNA-related aberrations, and multiple miRNA-processing factors 
have been associated with R-loop formation”*”, including D/CER1, 
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which has shown to be directly involved in DNA repair and contains 
helicase activity”’. To test this, we carried out dot blot andimmunostain- 
ing experiments for R-loops and double-stranded breaks in a Dicer1 
knockout mouse stem cell line and its isogenic control. Indeed, both the 
levels of R-loops and double-stranded breaks—as indicated by increased 
y-H2AX staining—were increased in the Dicer1 knockout cells (Fig. 4f-h). 
In addition, sequencing detected chromothripsis events and many 
breakpoints in RLFS that resemble the phenotype in ETMRs around 
C19MC (Extended Data Fig. 10h, i). These data suggest that R-loops may 
cause chromosomal instability in ETMRs and defective miRNA process- 
ing might have a role in generating the increased levels of R-loops. 


Efficacy of PARP and TOP1 inhibition 


Finally, we investigated whether R-loops could potentially be exploited 
for therapy. Previously, we reported that ETMRs are highly sensitive 
to inhibitors of topoisomerase I (TOP1), an enzyme that can resolve 
R-loops among other functions!””’. Other studies have shown that 
topotecan and irinotecan act as a TOP1 poison by covalently binding 
TOP1 to the DNA’, which can be further enhanced by trapping PARP1, 
as PARP1is able to release TOPI by parylation”. Therefore, we tested 
whether a combination treatment with PARP and TOP1inhibitors would 
lead toa further increase in R-loops and increased response to therapy. 

First, we tested whether there is synergy between PARP and TOP1 inhi- 
bition in ETMR cells. Topotecan alone is effective with half-maximum 
inhibitory concentration (IC;,) values of around 5 nM, whereas PARP 
inhibitors are less effective withIC,,. values of approximately 10 pM for 
pamiparib (BGB-290) and about 15 pM for veliparib. Both PARP inhibi- 
tors, however, act highly synergistically when used in combination with 
topotecan and lead toa larger decrease in viability than monotherapy 
with any of the drugs (Fig. 5a). Synergy and cell death are only observed 
once concentrations of topotecan were higher than 1.25 nM, suggest- 
ing that the synergy is mostly dependent on sufficient levels of TOP1 
inhibition (Fig. 5b). 

Next, we investigated whether topotecan alone or combined with 
PARP inhibitors is correlated with R-loop levels. Topotecan treatment 
(6h) increased the number of nuclear R-loops whereas pamiparib had 
almost no effect on R-loops (Fig. 5c). Combination therapy induced 
R-loops at an even higher rate and caused more DNA damage as shown 
by y-H2AX staining than monotherapy. Quantification of the nuclear 
signals shows that y-H2AX and R-loop signals highly correlate, sug- 
gesting that an increase in R-loops increases the amount of DNA dam- 
age (Fig. 5d). Together, these data show that PARP and topoisomerase 
inhibitors act synergistically in ETMRs and combination treatment 
could, after thorough in vivo testing, potentially be used as treatment 
for patients with ETMR. 


Discussion 


Here, we present the molecular landscape of ETMRs at diagnosis and 
relapse, resulting in a set of hallmarks that describe this tumour type 
(Fig. 6). We show that all ETMRs are molecularly similar, despite differ- 
ences in histology, the presence or absence of C19MC amplification 
and their location in the brain. CI9MC amplification is still considered 
the main driver based on its strong conservation and high recurrence. 
Moreover, we identified D/CER1 as a predisposition gene for ETMR. 
Our data provide further insights into the drivers of these tumours 
and howthetumours change after treatment, largely because of treat- 
ment with cisplatin. 

miRNA expression profiles correlate strongly between ETMRs even 
though the underlying mechanism that deregulates miRNAs may be 
different. A possible explanation for this might be that tumours with 
C19MC or miR-17-92 amplification oversaturate the miRNA-process- 
ing machinery. This could potentially mimic deregulated process- 
ing by DICER1 and also explains why RNA transport—vital for miRNA 
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Fig. 6 | Hallmarks of ETMR. Schematic summary of ETMR characteristics. 


processing and shown to be affected by miRNA saturation—is highly 
upregulated in ETMRs*°” (Extended Data Fig. 3). Oversaturation of 
the miRNA machinery could also explain why all ETMRs regardless 
of amplification status show many structural aberrations as several 
members of the miRNA processing machinery, including DICERI and 
DROSHA, have also been associated with R-loop formation”. 

Other mechanisms that cause chromosomal instability may also 
play a part. Breakpoints were enriched in RLFS, which are an intrinsic 
property of the genome, suggesting that R-loops predispose certain 
locito acquiring DNA damage. Especially because R-loops were found 
in close proximity of C19MC, there could be an event before C19MC 
amplification. Considering the early onset of the disease and strong 
conservation of SVs, itis possible that there is a genetic predisposition 
that increases the levels of R-loops, which in turn causes breakage at 
sites that form R-loops, leading to C1I9MC amplification in ETMRs. Also, 
C19MC amplification itself might further increase the levels of R-loops 
causing the number of breakpoints to increase. 

Onesuch genetic predisposition is identified here through recurrent 
DICER1 germline mutations. With this finding, ETMR is added to the 
large spectrum of tumours that may arise in the context of D/ICERI syn- 
drome”. The link between chromosomal instability, R-loops and DICERI 
mutations in ETMRs may occur in other cancers as well, providing a 
rationale for the cancer susceptibility seen in those patients. Indeed, 
tumours associated with D/CERI syndrome seem to frequently acquire 
CNAs®*, although whether this is related to R-loop formation remains 
to be elucidated. We recommend that patients with ETMR and their 
families should consider genetic counselling for DICERI syndrome, 
at least in cases in which C19MC is not amplified. 

Presence of R-loops can also have therapeutic implications. A recent 
study in patients with Ewing sarcoma has shown that R-loops may be 
associated with hyper-transcription, which leads to the sequestration 
of BRCA1and prevents BRCA1 from promoting double-stranded break 
repair”. This explains why targeting these tumours with PARP inhibi- 
torsis successful. Here, we show that TOP1 inhibition can increase the 
levels of R-loops that cause an increased amount of damage to cells that 
already have high levels of R-loops. This mechanism may also be appli- 
cable to other drugs, providing potential novel treatment strategies. 

In summary, by characterizing the events that drive ETMRs, we 
provide a valuable resource for future ETMR research and potential 
novel therapies that could be further pursued to treat patients with 
this deadly tumour type. 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Experimental model and patient details 

All patient samples included in this study were acquired under informed 
consent according to the ICGC (www.icgc.org) guidelines or INFORM 
(www.dkfz.de/en/inform/) guidelines (for cases ET85 and ET86) and 
all included relapsed tumours from those patients. For all included 
cases, consent was approved by the review board of the contributing 
centres before shipment. Primary tumours were derived from patients 
who did not undergo radiotherapy or chemotherapy before surgi- 
cal removal of the tumour. Material from relapses was derived from 
patients that received therapy as described in Supplementary Table1 
(radiotherapy or chemotherapy, which, in all cases, included cyclo- 
phosphamide and either cisplatin or carboplatin). 

Samples were included based on different ETMR histologies as 
described previously’, 450k/EPIC classification’® and amplification 
of C19MC was confirmed by visual inspection of copy number profiles. 
As many tumours with central nervous system primitive neuroectoder- 
mal tumour (CNS-PNET) histology could be reclassified as described 
previously®, tumours with CNS-PNET and pineoblastoma histologies 
were also included if there was sufficient evidence based on 450k/EPIC 
clustering. The presence or absence of C19MC aberrations, based on 
copy numbers, was validated in approximately half of the cohort of 
primary tumours (90 tumours) using fluorescence in situ hybridiza- 
tion (FISH) and/or whole-genome sequencing (WGS) and mate pair 
sequencing when available. 

All experiments involving cell culture of ETMR cells were performed 
using the BT183 cell line that was obtained from). A. Chan and the char- 
acteristics of this cell line have been described in the original pub- 
lication®*. Cells were cultured in low-adhesion cell-culture flasks as 
spheroid cultures as described previously. The cell line was derived 
from a primary tumour of a two-year old patient and was acquired 
after informed consent. DicerI” and Dicer!™ mouse stem cells were 
obtained from ATCC (CRL-3220, CRL-3221) and cultured as specified 
by the manufacturer. All cell lines have been tested for mycoplasma 
contamination and were found to be negative. 


Sample and library preparation for DNA sequencing 

Out of the 70 primary tumours, we performed WGS for 20 cases, all with 
matched germline samples, and whole-exome sequencing for 17 cases 
without germline samples to find (recurrent) genetic events that could 
potentially deregulate important pathways. In addition, we performed 
targeted sequencing for another 33 primary ETMRs and one matched 
recurrence, using a gene panel of 158 genes that were found to be recur- 
rently affected in other brain tumour types” (Supplementary Table 4). 

DNA was isolated from fresh-frozen tumour material using either the 
Qiagen DNeasy Blood and Tissue kit or the Promega Maxwell RSC Tissue 
DNA Kit (AS1610). DNA from the matching blood samples was extracted 
using the Qiagen Blood and Cell Culture Midi Kit or Promega Maxwell 
RSC Blood DNA Kit (AS14000) according to the provided protocol. For 
samples sequenced using targeted sequencing and methylation profil- 
ing of samples for which only formalin-fixed paraffin-embedded (FFPE) 
material was available, DNA was extracted using the Maxwell RSC FFPE 
kit (AS1450) following the manufacturer’s instructions. 

WGS libraries were prepared using the Illumina TruSeq Nano DNA 
LT Library Prep or TruSeq Nano DNA HT Library Prep Kit following 
the manufacturer’s instructions. In brief, 100 ng of genomic DNA was 
fragmented to approximately 350 bp using a Covaris ultrasonicator 
(Covaris). The fragmented DNA was then end-repaired, size-selected 
using magnetic beads, extended with an ‘A’ base on the3’ end and ligated 


with TruSeq paired-end indexing adapters. The adaptor-ligated frag- 
ment libraries were enriched using eight cycles of PCR and purified 1-2 
times using magnetic beads. All generated libraries were validated using 
the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific) and using 
Agilent 2200 or 4200 TapeStation (Agilent Technologies). Libraries 
were sequenced on the Illumina HiSeq X (2 x 151-bp paired-end reads) 
according to the manufacturer’s protocol. 

For samples ET1, ET12, ET16, ET19, ET20, ET30, ET31, ET66, ET68, 
ET71, ET72 and ET87 next-generation sequencing (NGS) libraries were 
created as described previously using Illumina v.2 protocols®. Librar- 
ies for those samples were subsequently sequenced using HiSeq 2000 
v.4 instruments (paired-end 125-bp reads) using three lanes on the 
machine and achieving a comparable coverage to samples sequenced 
onthe HiSeq X. The average sequencing coverage was 33.95x for tumour 
material and 34.99x for blood. 

Allsamples sequenced for WGS were submitted to the DKFZ Genom- 
ics and Proteomics Core Facility and were only included for library 
preparation after passing all standard quality controls. Only samples 
with a DNA integrity over 7 were included in the study. 

Sequencing of samples using whole-exome sequencing was per- 
formed by creating libraries using the IIluminaTruSeq exome enrich- 
ment kit following the manufacturer’s instructions after size selection. 
Size selection was performed by fractionation using a Covaris ultra- 
sonicator and subsequent selection was performed using a 1.5% gel 
Pippin Prep cassette (Sage Science). Sequencing was performed at 
the Genome Quebec Innovation Centre using Illumina HiSeq 2000 
instruments or at the ICGex NGS platform of the Institut Curie using 
HiSeq 2500 instruments for the two D/CERI-mutated samples that 
were reported previously”. 

Targeted sequencing was performed by creating libraries using 
the Agilent SureSelect XT technology. Libraries were sequenced 
using molecular barcode-indexed ligation-based sequencing using 
a NextSeq500 (Illumina) instrument”. For the targeted sequencing, 
we sequenced genes at an average coverage of roughly 100~ for the 
genes listed in Supplementary Table 4. 


DNA methylation array 

DNA methylation profiling was performed as described previously®”’. 
DNA was extracted in the same manner as described for WGS using 
500 ng as input material for fresh-frozen tissue and 250 ng input 
material for FFPE tissue. Array data were created using the Infinium 
HumanMethylation450 BeadChip array (450k array) according to 
the manufacturer’s instructions (Illumina) at the DKFZ Genomics and 
Proteomics Core Facility. For asubset of samples (described in Sup- 
plementary Table 1) Methylation BeadChip (EPIC) arrays were used. 
CpG probes that were used for the analysis were filtered based on the 
presence of acommon SNP within five bases of the probe, reads not 
mapping uniquely to the reference genome, probes mapping to the X 
and Y chromosome and reads not overlapping between 450K arrays 
and EPIC arrays. 

Clustering was performed after correction of samples for the origin 
of the DNA (FFPE or fresh frozen) using surrogate variable analysis 
(sva)*° and only the 10,000 most variable probes based on the full 
dataset after correction were selected for clustering. Distance was 
calculated using 1 — Pearson correlation and linkage was calculated 
using average as measure. Subsequently, t-stochastic neighbourhood 
embedding (t-SNE) analysis using RTSNE (v.0.13) was applied to gener- 
ate the figures”. 


Copy-number profiling and analysis 

Copy-number profiles were created using the conumee package 
(v.1.3.0). For the analysis of copy numbers, the average change in copy 
number observed for chromosome 2 gains in annotated samples from 
a previous study was used as a cut-off to determine gains and losses’. 
Copy numbers were subsequently manually assessed to prevent false 
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positives and false negatives due to differences in tumour cell content 
or ploidy. Calling of focal amplifications or deletions was performed 
similarly using three times the average change in copy number for 
chromosome 2 gains followed by manual assessment. For samples 
with increased ploidy CNAs were filtered out manually that were likely 
to be due to ploidy changes to reduce the number of false positives. 


Processing and alignment of DNA sequencing 

Sequenced reads using WGS or whole-exome sequencing approaches 
were aligned to the human genome version 19 (hg19) reference genome 
using BWA-mem v.0.7.8-1455 using default settings and T -0. Lanes were 
sorted using biobambam bamsort (v.0.0.148) and duplicate reads were 
removed using biobambam bammark duplicates. For data generated 
using the HiSeq X Picard (v.1.125) (https://broadinstitute.github.io/ 
picard/) was used to filter out duplicate mapping reads. Reads were 
removed ifthe phred-scaled base quality over the length of the read was 
below 25. For targeted sequencing, reads were aligned to the human 
reference genome version 19 (hg19) using BWA-mem v.0.7.12-r1039. 
Duplicated reads were removed using Picard v.1.113 and reads were 
sorted using SAMtools v.0.1.17-r973”. Reads were removed if the phred- 
scaled base quality over the length of the read was below 25. 


Somatic SNV and insertion or deletion calling 

Detection of somatic SNVs and insertions or deletions (indels) in WGS 
data was performed using our in-house pipeline Roddy (v.1.1.73). The 
pipeline is based on SAMtools (v.0.1.17-r973) mpileup and bcftools 
using parameter adjustments allowing for SNV calling*®. In addition 
to those filtering steps, we applied additional filtering to remove low- 
quality SNVs as described previously’. Variants were excluded that 
fell into ENCODE DAC blacklisted regions, Duke excluded regions, 
the hiSeqDepthTopPt1Pct track from UCSC genome browser or vari- 
ants that had boththe reference and altered allele annotated in dbSNP 
(https://www.ncbi.nIm.nih.gov/SNP/, release 2 November 2015). In 
addition, we applied filtering criteria restricting the overlap of variants 
with features including tandem repeats, simple repeats, low complex- 
ity, satellite repeats or segmental duplications to a maximum of two. 
Finally, all variants were filtered out that did not fulfil the heuristic 
criteria of having at least five reads in the sequenced tumour covering 
the position, a coverage of more than 10 reads but less than 300 reads 
in the germline control, more than 3% of the reads of the alternative 
variant in the germline, no bases other than wild type or variant at the 
position, at least one read on every strand supporting the variant (or 
more than five reads in total) and a variant allele frequency at least 
over 10%. The final results were annotated using ANNOVAR (release 
1 February 2016)“ using data derived from Interpro (https://www.ebi. 
ac.uk/interpro/, release 15 October 2015), the Combined Annotation 
Dependent Depletion (CADD) phred score* (ljb26, release 25 Sep- 
tember 2014), cosmic (https://cancer.sanger.ac.uk/cosmic, release 
11 September 2014), dbSNP and clinvar (https://www.ncbi.nIm.nih. 
gov/clinvar/, release 2 March 2016). 

For targeted sequencing, SNVs were detected using SAMtools mpi- 
leup and bcftools using a similar method as described for WGS witha 
couple of changes to increase stringency: variants were filtered out that 
did not have aminimum read depth of 20, a minimum root-mean-square 
mapping quality of 30, a minimum of three reads supporting the vari- 
ant, at least one read on every strand that supported the variant and an 
allele frequency of at least 10%. In addition, variants were filtered out 
that occurred in more than 0.1% of the 1OOO Genomes Project popula- 
tion (http://www.internationalgenome.org, release 24 August 2015) 
or in more than 0.1% of the nonTCGA ExAC population (http://exac. 
broadinstitute.org, release 23 April 2016). All reported coding SNVs 
and coding indels detected using panel sequencing were annotated 
using ANNOVAR and benign variants were filtered out. 

For indel calling (<50 bp) in targeted sequencing both SAMtools 
mpileup and Platypus (v.0.8.1) were used as described previously*®. 


The same criteria were applied to indels that were applied to SNVs 
including read depth, mapping quality, reads supporting the variant 
allele frequency, allele frequency in general population, filtering of 
benign variants and manual reviewing. Indel calling and SNV calling 
in exome sequencing was performed in a similar manner as targeted 
sequencing with an extra filtering step at alignment: as two germlines 
were available both were combined to forma pseudo-germline and this 
pseudo-germline was used for filtering variants. 


Germline SNV and indel calling 

Germline SNV and indel calling was performed as described previously 
using freebayes (https://github.com/ekg/freebayes, v.1.1.0) applying 
the same settings and using the same filtering criteria** using a panel 
of genes described in Supplementary Table 4. Genes not in the panel 
were excluded from analysis. In brief, raw variant predictions were 
filtered (QUAL >20, QUAL/AO >2, SAF >1, SAR >1, RPR >1 and RPL >1), 
and normalized across patients with vt (https://genome.sph.umich. 
edu/wiki/Vt, version 0.5)). 

Putative germline mutations were defined as frameshift, stop gain/ 
loss, start loss, canonical splice site, exon/gene deletions, and damag- 
ing non-canonical splice site variants or pathogenic missense variants 
based on ClinVar annotations. In addition, homozygous missense vari- 
ants, or missense variants having a somatic second hit that are present 
ina functional domain with a CADD phred score higher than 20 were 
annotated as putative germline mutations. 

Putative germline mutations were filtered out if the minor allele 
frequency was higher than 0.1% in any continental population based 
on the non-TCGA ExAC database, the 1000 Genomes Project or ESP 
(http://evs.gs.washington.edu/EVS/). 

After annotation, filtered germline mutations were excluded from 
the analysis if annotated as benign in ClinVar. In addition, variants 
with a reduction in allele frequency to less than 40% in the tumour 
were excluded. Finally, we only retained variants that were recurrent 
within the cohort. The context of all remaining variants was reviewed 
manually using IGV to prevent false positives. 

For investigation of non-coding regions, we included regions 100 bp 
preceding the transcription start site (based on known protein-cod- 
ing genes defined in Gencode v.19) and putative enhancers (based 
on H3K27ac data from cultured H9 neural progenitor cells, Encode, 
ENCSR449AXO, Bernstein laboratory)”. 


SV discovery using paired-end sequencing data 

For SV discovery, aligned reads were processed with Delly (v.0.7.5) using 
paired-end mapping and split-read analysis”. Centromeric and telom- 
eric regions of hg19 were excluded from the analysis and, to maximize 
specificity for the latter, somatic filtering was run jointly on each pair 
of tumour and matched control samples. For germline SV discovery, 
we merged all SVs using the Delly merge command and genotyped all 
SV sites in all control samples. We then discarded SVs with an allele 
count of zero and filtered SVs using the germline mode of the Delly 
filter command. All germline SVs were then intersected with genes 
and protein-truncating variants, such as the DICERI germline exon 19 
deletion, were manually inspected using IGV. 

For somatic SV discovery, we first filtered each tumour sample 
against the matched control using the somatic mode of the Delly fil- 
ter command. We then aggregated all somatic variants and genotyped 
them once more across all samples to fetch putative mismapping arte- 
facts of high allele count. All somatic SVs that passed the second round 
of filtering against a panel of normal samples were collected for each 
tumour sample and overlaid on read-depth plots for manual inspection. 


Mate pair sequencing 

Mate pair DNA library preparation was carried out using the Illumina 
MP v.2 reagents and protocol as described previously”. In brief, 
fragmentation of genomic DNA was performed using a Hydroshear 


device to an insert size of 4.5 kb followed by sequencing with Illumina 
HiSeq 2000 instruments. Alignment was performed using Eland (v.2) 
retaining only uniquely aligned reads for downstream rearrangement 
analysis using Delly. Mate pair sequencing was applied on complex 
rearrangements detected using WGS mainly as validation and recon- 
struction of the rearrangements but was not applied for discovery of 
structural alterations. 


Oncoplots 

Oncoplots were generated using custom scripts. Events (either ger- 
mline or somatic) were only included if the minor allele frequency 
in the non-TCGA ExAC population was under 1% or unknown and the 
variant allele frequency was higher than 20%. In addition, every event 
had to fulfil one of the following criteria: the gene was either recur- 
rently affected with at least one event present in the WGS cohort or a 
loss-of-function mutation within a recurrently affected pathway. Events 
were only included in the figure if events occurred in ETMRs without 
C19MC amplification, in multiple ETMRs with C19MC amplification or 
were loss-of-function mutations in deregulated pathways. For the CNA 
oncoplot, all CNAs were included after manual filtering for CNAs that 
were probably the result of increased ploidy. 


Conservation between primary and relapse tumours 
Conservation of events between primary tumours and recurrences was 
performed using somatic calls of SNVs, indels and SVs. Variant allele 
frequencies were recalculated at the defined positions using SAMtools 
mpileup followed by Varscan pileup2cns (v.2.3.9)*°. Variants were only 
included if the coverage in both germline and tumour was at least 10 
reads, the coverage in both primary and recurrence was at least 10 reads, 
and noalleles other than wild type or variant were detected in the ger- 
mline, primary tumour or recurrence. Inall sample combinations, more 
than 95% of SNVs had sufficient reads for further analysis. Retained 
events were defined as events that had an allele frequency over 20% in 
both primary and recurrence, gained mutations in recurrences were 
defined as mutations that had an allele frequency less than 2% in the 
primary tumour and more than 10% in the recurrence and events that 
were lost in recurrence were defined as events with allele frequencies 
over 10% in primary tumours but less than 2% in recurrences. 


Calculation of mutational signatures 
Mutational signatures were calculated as described previously and 
compared to 30 previously published signatures“ and the recently 
described P1 signature”. In brief, somatic mutations and mutation 
context (adjacent bases) were extracted from the called SNVs and used 
to construct a catalogue of every possible context for all samples. We 
calculated the exposures using the formula M = P x E where Mis the 
mutational catalogue; Pis the mutational signature and Fis exposures 
(contribution of mutational process within the mutational landscape). 
Signatures were calculated for both primary tumours and recur- 
rences with a mutation count over 200. Whole exomes and samples 
sequenced using targeted sequencing were therefore excluded from 
analysis. Exposures present under 5% in a sample were removed and 
exposures over 5% were reconstructed to form 100% of the sample to 
reduce the amount of noise. Changes in mutational signatures between 
primary tumours and recurrences were also calculated using only the 
30 previously published signatures“ (data not shown). Signature cal- 
culation was also performed de novo and five signatures were called. 
Compared to the 30 previously published signatures, one novel signa- 
ture was found. This novel signature had a cosine similarity over 0.85 
only for signature P1, therefore the P1 signature was included in the 
analysis instead of only the 30 classical signatures. 


Calculation of transcriptional asymmetry 
Transcriptional asymmetry was calculated using the MutationalPat- 
terns (v.1.4.3) R package based on previously described methods”. 


Mutations were considered that overlapped gene bodies and were 
combined from either all primary tumours or all relapses. Substitu- 
tions were assigned as transcribed when present on the same strand 
as the gene definition (UCSC, hg19) and untranscribed if present on 
the opposite strand compared to the gene definition. When multiple 
genes in different directions overlapped, events were excluded in this 
region. Events were split in different substitution types and asymmetry 
was calculated separately for every substitution type for both primary 
and relapse tumour samples. 


RNA profiling 

RNA was isolated from 28 ETMRs using the Qiagen RNeasy Mini kit or 
the Maxwell RSC simplyRNA Tissue Kit (AS1340) using homogenized 
tumour tissue. All RNA isolations were performed according to the 
manufacturer’s protocol. Quality control was performed using an Agi- 
lent Bioanalyzer 2100 instrument and samples with an RNA integrity 
index over 7 were included. Samples were profiled using Affymetrix 
GeneChip Human Genome U133 Plus2.0 arrays at the DKFZ Genomics 
and Proteomics Core Facility applying the manufacturer’s protocol for 
preparation, hybridization and quality control. 


Analysis of mRNA data 

Samples were normalized and probe detection P values were calculated 
using the MASS.O algorithm (Affymetrix), followed by quality control 
of the percentage of present calls and manual inspection of GAPDH 
levels and 5’ to 3’ ratios. All probes were filtered to contain every gene 
once in the analysis. Analysis of atypical teratoid rhabdoid tumours 
(ATRT)*, medulloblastoma (MB)” and CNS-PNET° data was performed 
inthe same way for every sample included in the analysis. 

Differential expression was calculated using a two-way ANOVA using 
aPvalue of 0.01 as a cut-off (corrected using false-discovery rate (FDR)). 
In addition, an absolute fold change of 2 and a minimum log,-trans- 
formed gene expression value of 5 in at least one of the compared sets 
were used as criteria for the definition of differentially expressed genes. 
Heat maps were generated using supervised clustering of expression of 
450 DNA repair genes shown in (Supplementary Table 4) and pathways 
described previously’. Allsamples were z-score normalized and cluster- 
ing was performed using Euclidean distance. All mRNA expression data 
were analysed using the R2 platform (http://R2.amc.nl). 

t-SNE clustering was performed by using z-score-normalized gene 
expression using only representative probes by applying HugoOnce 
(http://R2.amc.nl). Gene expression data were derived from a set of 
580 samples deposited in the R2 platform and 28 ETMRs, which were 
all processed in the same way. Using these samples, a 50-dimension 
principal component analysis was performed followed by t-SNE clus- 
tering using RTSNE. 


GO and KEGG enrichment analysis 

GO-term enrichment and KEGG pathway enrichment was performed 
using TOPPgene™ and filtered using significant terms also found using 
the DAVID algorithm (v.6.8)**. For all terms a P value cut-off of 0.05 
was used and P values were corrected using FDR, Bonferroni and Ben- 
jamini-Hochberg corrections. For GO term enrichment a gene limit 
of 200 was chosen to obtain more specificity in the results. Processes 
were defined using similarity between GO terms that was calculated 
using NaviGO® using a minimum co-occurrencescore of 0.05. For terms 
without similarity to any significant terms this was manually assessed. 


Estimating cell populations from bulk mRNA sequencing 

The Cibersort algorithm (v.1.06)° was applied to estimate the relative 
frequencies of different cell populations using normalized expres- 
sion data. Signature genes were derived from the median gene expres- 
sion levels of defined cell populations from the prefrontal cortex as 
described previously’®. Only informative genes were selected that hada 
median expression higher than 1 in any of the subgroups. Both relative 
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and absolute mode were applied using 500 permutations to estimate 
the relative abundance of each sample type. 


Microdissection of FFPE material 

AS arepresentative FFPE tissue sample, ET174 was histologically iden- 
tified, targeted and microdissected with a puncher for nucleic acid 
extraction. DNA was extracted in a similar manner as described for 
other FFPE material, RNA was extracted using the automated Maxwell 
system with the Maxwell 16 LEV RNA FFPE Kit (Promega), according to 
the manufacturer’s instructions. To evaluate FFPE RNA quality, we used 
the percentage of RNA fragments >200 nt fragment determination 
value (DV399). Only RNA samples with DV 9. > 70% were included for 
sequencing on a NextSeq 500 (Illumina). RNA-sequencing data were 
quantified using the quant option of kallisto (v.0.43.0) using standard 
settings®. 


miRNA sequencing and processing 

Small RNAs were isolated as described previously” from fresh-frozen 
tumour material. In brief, total RNA was extracted using guanidinium 
isothiocyanate/phenol extraction followed by 3’-adaptor ligation of 
barcoded adenylated adaptors. Samples were pooled intwo sets of five 
samples. Subsequently, gel electrophoresis was used to isolate small 
RNAs (19-35 nt) and purified using ethanol precipitation. Fragments 
were then amplified using standard PCR, isolated using gel electropho- 
resis and purified using ethanol precipitation. Samples were sequenced 
ona HiSeq 2000 v.4 machine. 

Small RNA-sequencing data were aligned using Bowtie v.1.00 (--seed- 
mms 1--maqerr 1000 --seedlen 21 --norc -M1--best --strata) to mirbase 
18 after reads were selected by cutting off adapters using cutadapt v.9.5 
(-e 0.1-m18 -M 34 -O 8 TCGTATGCCGTCTTCTGCTTG) and taking reads 
between 18 and 34 nucleotides. Subsequently, reads were counted 
using SAMtools 0.1.17-r973 mpileup. 


Analysis of miRNA data 
Mature miRNA counts were quantified relatively to the total read count 
separately for both the 5p and 3p strand and the resulting reads per 
million (RPM) values were used in all subsequent analysis. Supervised 
clustering of miRNAs was performed using only significantly differen- 
tially expressed miRNAs between ETMRs with C19MC amplification and 
other samples (excluding ETMRs without C19MC amplification) using 
adjusted negative binomial testing with the DESeq2 package (v.1.18.1)”. 
Samples were normalized using z-score normalization and clustered 
using hierarchical clustering using average as method. Unsupervised 
clustering was performed after filtering for miRNAs that had an expres- 
sion over 32 RPM in any included sample. Samples were normalized and 
clustered in the same way as the supervised analysis. 

miRNA processing was quantified by comparing the normalized 
levels of 3p mature miRNAs to the levels of Sp mature miRNAs using 
sequences provided by miRbase 18°. For every sample, RPM levels 
were compared for each 5p versus 3p miRNA and the mean value was 
calculated as the processing ratio. miRNAs with multiple 5p or 3p vari- 
ants were averaged. All miRNA data were compared against a set of 10 
random samples per entity derived from unpublished ICGC data (C. 
Aichmuller et al., manuscript in preparation) that were processed in 
the same way as the ETMR data. 


Analysis of R-loop levels 

DNA-RNA hybrids were extracted from tissue derived from ETMR 
patient-derived xenograft (PDX) models (BT183) that were treated 
using topotecan or saline as described previously”. Tumours were 
subsequently frozen and pelleted using ultracentrifugation. DNA-RNA 
hybrids were extracted as described previously using the same protocol 
that is applied for cultured cells”. DNA was extracted using proteinase 
K followed by phenol-chloroform extraction and ethanol precipitation. 
Subsequently the DNA was fragmented using the restriction enzymes 


HindIII, EcoRI, BsrGI, Xbal and SspI (New England Biolabs). Digested 
DNA was subsequently incubated with the anti-DNA-RNA hybrid anti- 
body S9.6 (Merck, MABE1095) and immunoprecipitated using agarose 
beads. Bound DNA-RNA hybrids were eluted and incubated with pro- 
teinase K and cleaned with an additional phenol-chloroform-ethanol 
extraction. The DNA was subsequently sonicated and sequenced using 
a Hiseq 2000 machine with a 50-bp single-read protocol. Each treat- 
ment condition was performed in duplicate and both RNase H andthe 
input was included as negative controls. 


DRIP-seq analysis 

DRIP-seq data were aligned to hg19 using BWA using standard settings. 
Aligned reads were normalized to the input signal using bamCompare 
(deeptools 3.0.2) using log, fold change increase as output for visualiza- 
tion purposes”. Peak calling was performed using MACS v.2.0 (https:// 
github.com/taoliu/MACS) using the settings -g 2.7e9 -q 0.05, -B-m 
230 and the input signal as background. Peak files were combined 
for each condition and each subsequent analysis was performed for 
both topotecan-treated samples and untreated samples. Peak calling 
was compared to data from Ewing sarcoma” and correlated to mRNA 
expression to ensure the quality of the dataset. 

RLFS were predicted using the qmRLFS finder (v.1.5) tool”° by apply- 
ing standard settings and using models m1land m2 for both the human 
genome (hg19) and mouse genome (mm10) depending on the analy- 
sis. Both the R-loop initiation zone and R-loop elongation zone were 
included in RLFS peaks. 


Calculating R-loop enrichments 

For the analysis comparing the overlap of SVs with R-loops, overlapping 
SVs were defined as either the start position of the SV, the end position 
of the SV or both positions falling in either an RLFS or DRIP peak depend- 
ing on the analysis. Duplicate breakpoints were excluded when over- 
lapping between matched primary tumour and relapse. The analysis 
comparing breakpoints in R-loops against the rest of the genome was 
performed by taking the total number of SVs falling in R-loop regions 
and by comparing this to 10,000 randomly generated regions of the 
same size, with the same number of regions as the R-loop regions. For 
the comparison against other tumour entities 68,018 somatic rear- 
rangements were used from a previously published study” and loca- 
tions were compared to either RLFS or R-loops derived from ETMRs. 
P values were calculated using Fisher’s exact tests and corrected for 
multiple testing using Benjamini-Hochberg correction. The same 
10,000 random regions were used for each entity and a P value was 
calculated using Fisher’s exact test for every iteration and adjusted 
using Bonferroni correction. Only entities with more than 50 break- 
points were used for the comparison. For comparisons of R-loops with 
genomic regions ranges from both non-B-DB 2.0” and repeatmasker 
4.0.8. (http://www.repeatmasker.org/) were aggregated andthe median 
enrichment over the input signal was calculated for 10,000 randomly 
selected regions of every DNA element. 


FISH analysis 

FISH was applied as described previously to validate C19MC amplifica- 
tion®. In brief, two probes corresponding to the 19q13.42 and 19q13 
loci were applied, using the 19q13 probe as a reference. The minichro- 
mosome formed between chromosome 11 and 13 was validated using 
probes at the loci 11q22.2 and 13q31.3. Ploidy of 28 different ETMRs was 
validated in a similar manner using FISH probes at the chromosome 9 
and chromosome 11 centromeres as those chromosomes were found 
to be relatively stable in our cohort. 


Detection of R-loop levels 

DNA-RNA hybrids were extracted from cell pellets of Dicer wild-type 
and knockout mouse cells after lysis for 12-24 h, with a solution con- 
taining TE, SDS and proteinase K as described previously®*. DNA-RNA 


hybrids were subsequently extracted using phenol-choloform isoa- 
myl alcohol (Affymetrix, 75831) followed by phase-lock gel separation 
(Quantabio, Sprime phase lock gel light, 2302820). DNA-RNA hybrids 
were subsequently purified using ethanol precipitation. DNA-RNA 
hybrids were diluted in 10x SSC buffer and loaded ona charged nylon 
membrane (Roche, 11209299001). Membranes were blocked in1x TBS 
containing 5% skimmed milk and incubated overnight using an anti- 
DNA-RNA hybrid (S9.6) antibody (Merck, MABE1095) at a concen- 
tration of 1:1,000. Subsequently an HRP-linked secondary antibody 
(Santa Cruz, sc-2005) was used followed by incubation with ECL (GE 
Healthcare, RPN2232). Methylene blue staining (Sigma-Aldrich, M9140) 
supplemented with 0.3 mM NaOAc was used as a loading control for 
all dot blots. Dot blots were developed using a chemoluminescence 
imaging system (Intas, ECL chemostar). Experiments involving dot 
blots were performed five times to ensure reproducibility. 

R-loop stainings for tumour material were performed on formalin- 
fixed tissue slides of five ETMRs, five group 4 medulloblastomas and 
five WNT medulloblastomas using the same conditions as described 
previously”. For all immunohistochemistry experiments, representa- 
tive photographs of the slides were taken in addition to photographs 
covering the entire slide. 


In vitro drug response 

Response to treatment with pamiparib (BGB-290) (MedChemExpress, 
HY-104.044) or veliparib (Abbvie, S1004) and topotecan (ApexBio, 
B4982) was evaluated using dose-response curves. BT183 cells were 
plated in 96-well plates 24 h before treatment. Twofold increasing 
concentrations of the inhibitor were added with concentrations rang- 
ing from 0.08 nM to 20nM for topotecan and 156,25 nM to 40 mM for 
pamiparib and veliparib. Cell viability was measured 72 h after treat- 
ment using an automated plate reader after using CellTiter-Glo (Pro- 
mega, G7570). Graphpad Prism was used to generate dose-response 
curves. For combination treatment, the IC,, was determined using 
the dose-response curves of single inhibitors and added to increas- 
ing concentrations of the other drug. Synergism between inhibitors 
was calculated by applying the Chou-Talay method as described 
previously®. 


Immunofluorescence 

BT-183 cells were grown on sterilized glass coverslips coated with 
human laminin (Sigma, L2020) and poly-L-lysine (Sigma, P8920) for 
24h before treatment with DMSO (0.6%), topotecan, pamiparib or both 
pamiparib and topotecan. Before fixation using 4% PFA, spheres were 
washed with 0.5% NP-40 for 5 min on ice to increase the permeability. 
Subsequently, cells were permeabilized using 0.3% Triton X-100 for 
10 min followed by 1h blocking with 10% donkey serum. Slides were 
incubated with primary antibody 1:100 for y-H2AX (Abcam, ab11174) 
or S9.6in 10% donkey serum overnight. Subsequently, coverslips were 
incubated with DAPI (1:1,000) and Alexa Fluor 488/568 (1:500) (Life 
Technologies). Coverslips were mounted on glass slides using mount- 
ant (Invitrogen, P36930) and imaged using a confocal microscope 
(Zeiss LSM 800). Representative pictures were taken at 400x mag- 
nification and processed using Airyscan processing (included in the 
Zeiss LSM 800 software) and applying the middle of 21 z-stacks. Red 
and green channels were normalized to the blue channel (DAPI) in all 
shown pictures. 

Signals were quantified using ImageJ, applying acustom-made macro 
over three random neurospheres of seven z-stacks each for every treat- 
ment condition. Signal intensity was measured in separate regions 
by binarizing DAPI signal and using watershed image segmentation 
to generate regions that simulate cells within the neurosphere. Sub- 
sequently, background subtraction (rolling ball radius of 50 pixels) 
was applied on every z-stack for every channel. For each channel the 
average signal per cell was calculated and the y-H2AX signal and S9.6 
signal was normalized to DAPI signal per cell. 


Dicer1 wild-type and Dicer1 knockout mouse cells were grown and 
mounted in the same way but were not grown on coated coverslips as 
cells were adherent. Representative images were taken at a 630 magni- 
fication using the middle of 11z-stacks. Quantification was performed 
at a200x magnification using three representative regions with more 
than five cells using five different z-stacks. Background correction, 
signal calculation and normalization was performed in the same man- 
ner as for the neurospheres. 


Statistics and reproducibility 

Statistical enrichment was calculated using Fisher’s exact tests for 
calculations of binary values and proportions unless stated otherwise. 
For comparison between continuous values Mann-Whitney U-tests 
(also known as Wilcoxon rank-sum tests) were used unless otherwise 
stated in either the figure legend or Methods section. Multiple testing 
correction was performed using the Benjamini-Hochberg method 
unless otherwise stated in the figure legend. All statistical analysis was 
performed inR (v.3.4.3) except for dose-response curves, which were 
calculated using GraphPad Prism 6. Unless stated otherwise, results 
from representative experiments were performed at least three times. 
Pictures shown in Fig. 4f, g are representative of three different regions 
each containing more than three cells, the experiment is representative 
of three biological replicates. Dot blots in Fig. 4h are representative of 
five biological replicates. Dose-response curves in Fig. 5a were based 
onthree biological replicates for each concentration of each drug and 
combination of drugs. The experiment was repeated three times and 
a representative curve is shown for each drug combination. Images 
shown in Fig. 5c, d are representative of three different regions each 
containing more than three cells, experiment is representative of three 
biological replicates. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Raw and processed 450K/EPIC methylation values, and raw and pro- 
cessed expression data for all included ETMRs are deposited at the 
Gene Expression Omnibus (GEO) under accession number GSE122038. 
All NGS data are deposited at the European Genome-phenome Archive 
(EGA) under accession number EGASO0001003256. Source Data for 
Figs. la—c, 2c, 3b, c, 4d, g, 5a, b, dand Extended Data Figs. la, 2a-g, 4b, 
c,h, 5c, 6b, c, 8a-d, 9b, c, e-g, 10g are provided with the paper. 


Code availability 


All custom code used to generate the data in this study is available 
upon reasonable request. 
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(position of the circle), histological diagnosis (outer ring) and C1I9MC status onthe site of occurrence was available. 
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Extended Data Fig. 2| miRNA expression correlates strongly between 
ETMRs with or without C19MC amplification. a, Supervised clustering of the 
416 differentially expressed mature miRNAs (two-sided negative binomial, 
Benjamini-Hochberg-adjusted P< 0.05) between ETMRs (n= 7) (excluding 
ETMRs without amplification) and other tissues (n = 38). b, Unsupervised 
clustering of mature miRNAs with a minimum expression of 32 in at least one 
sample anda variance higher than 10 between all samples (n= 294). 
Hierarchical clustering using average as distance measure was used to cluster 
the samples after values were z-score normalized. c-g, Regression of the 


Log2(expression ETMR C19MC+) 


median expression of mature miRNAs derived from ETMRs (n=7) against 
normal brain (n= 8), other entities (n=10 for all entities) or ETMRs without 
C19MC amplification (n=3). miRNAs that had a median expression below 32 
RPM in either of the compared entities were excluded. miRNAs that were 
differentially expressed between ETMRs (with and without C1I9MC 
amplification) against other entities (two-sided negative binomial, Benjamini- 
Hochberg-adjusted P< 0.05) are highlighted. For each comparison, the 
Pearson correlation was calculated (P< 0.0005 for all comparisons). 
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Extended Data Fig. 3| KEGG pathway enrichment inETMRs.a,b,Summaryof similarity based on NaviGO co-occurrence scores® and manual assessment. 
KEGG pathway enrichment of ETMRs (n= 28) against healthy brain tissues 


Significantly upregulated genes were calculated using ANOVA (FDR-adjusted 
(n=38) (a) or 580 different brain tumours (b). Pathways are coloured by P<0.01). 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4| ETMRs consist of at least two distinct subtypes of 
cells. a, Heat map showing z-score-normalized expression of 450 DNA repair 
genes and the corresponding pathways® for 190 tumours of different entities 
including 28 ETMRs. Supervised clustering was used and samples were sorted 
by entity or C19MC amplification status. Entities include three ATRT 
subgroups, four MB subgroups, central nervous system ewing sarcoma family 
tumour with C/C alteration (CNS EFT-CIC), central nervous system 
neuroblastoma, with FOXR2 activation (CNS-NB FOXR2), central nervous 
system high-grade neuroepithelial tumour with MN1 alteration (HGNET-MN1), 
central nervous system high-grade neuroepithelial tumour with BCOR 
alteration (HGNET-BCOR), ETMRs with amplification of C19MC (red) and 
ETMRs without amplification of C1I9MC (blue). ETMR subsets were manually 
assessed based on DNA repair pathway expression. b, Debulking of mRNA 


expression using CIBERsort by using the median expression of single-cell RNA- 


sequencing data of the forebrain as gene signature”. The cumulative fraction 
of each cell type was calculated and samples were sorted according tothe 
percentage of modelled neural stem cells. Samples were annotated based on 


the subsets derived froma.c, Box plots showing expression of stem cell 
markers (HMGAZ2, LIN28A), astrocyte markers (AQP4, GFAP) and genes involved 
in the DNA damage response (WEF1, CHEK2) inETMRs with high DNA repair 
expression (n=18) and low DNA repair expression (n=10). Pvalues were 
calculated using atwo-sided Mann-Whitney U-test; ***P< 0.0005, **P<0.005, 
*P<0.05;NS, not significant. Boxes show the median, first and third quartile, 
and whiskers extend to1.5x the interquartile range. d, Distribution of histology 
annotation of 18 ETMRs for which these data were available divided into two 
subsets. The number of EBL phenotypes was significantly enriched in the high 
DNA repair expression group using a two-sided Fisher’s exact test 

(P=3.7 x10”). e, SNE clustering based on methylation profiles of a 
microdissected ETMR (ET174) (split in bulk, rosettes and neuropil) and 192 
other ETMRs. f, Expression of LIN28A and AQP4 in rosette tissue and neuropil 
tissue of the same tumour. g, Copy-number profiles of microdissected neuropil 
and rosettes from the same tumour. h, Fold change in expression of six markers 
in two matched recurrences normalized to the primary tumour. 
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Extended Data Fig. 5| Recurrent events in ETMRs without C19MC 
amplification. a, Schematic representation of the translocation and 
amplification of aregion on chromosome 11 with the host gene of the miR-17-92 
miRNA cluster (also known as MIRI7HG) shown inred onchromosome 13. 
Regions were reconstructed using mate pair sequencing. The actual amplified 
regionis circular denoted by arrows on each end. b, Copy-number profile of a 
tumour containing the miR-17-92 cluster translocation and amplification. Copy 
numbers were derived from methylation array data with each dot representing 
a probe. Inset shows validation of both the chromosome 11 (YAPI; green) and 
chromosome 13 (M/RI7HG; red) amplifications using FISH. c, Quantification of 


Chromosomes 


mature miRNAs in the miR-17-92 miRNA cluster (n=20) confirms that the ETMR 
(blue) with the chromosome 11and chromosome 13 amplification and 
translocation has higher expression of miR-17-92 cluster miRNAs. Each bar 
represents one tumour corresponding to the given entity. Pvalues were 
calculated using a one-sided Mann-Whitney U-test; *P<0.05.d, Example ofa 
copy-number profile of acase showing clustered rearrangements around 
C19MC. This tumour did not have aC19MC amplification or DICERI mutations. 
e, Copy-number profile ofan ETMR without C19MC amplification or DICER1 
mutation showing an overall instable genome with many regions containing 
clustered breakpoints. 
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Extended Data Fig. 6 |ETMRsrecurrently show genomic instability. Events (copy-number changes, clustered breakpoints or increases in ploidy) 
a, Oncoplot showing the co-occurrence of all CNAs separated by C1I9MC that were gained upon recurrence have a thicker outline. Percentages denote 
amplification status. b, Overview of copy-number profiles of all ETMRs the percentage of matched samples acquiring a CNA or genome duplication. 
(n=193). Bars (gain, balanced and loss) add up to 100% for each chromosome e, Example of atumour for which polyploidy was validated using FISH (n=28 


arm.c, Overview of copy-number profiles of allETMRs with(n=170)orwithout — testedsamples), the chromosome 9 and 11centromeres were used as probes. 
(n=23) C19MC amplification. P values were calculated using two-sided Fisher’s f, Examples showing clustered breakpoints on chromosome 19. Chromosome 
exact tests and adjusted for multiple testing (Benjamini-Hochberg 19 is shownas acircular representation, translocations to other chromosomes 
correction); ***P< 0.0005, **P< 0.005, *P< 0.05. d, Overview of CNAsin were annotated as single positions. All SVs were detected using mate pair 
matched primary tumour and recurrence pairs for the most variable CNAs. sequencing. 
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Extended Data Fig. 7| Conservation of events for individual patients. (lost, primary AF >10% and recurrence AF < 2%; stable, primary AF > 20% and 
Summary of events occurring in seven matched primary tumours compared to recurrence AF > 20%; and gained, primary AF <2% and recurrence AF >10%). 
recurrences (first, second or third relapse) and two matched relapses. For Conservation of SVs is given asa circular representation of the genome withthe 
every sample conservation of SNVsis given asa graph with theallele CNAs fromthe primary tumour in the outer rim and the recurrence inthe inner 
frequencies (AF) of the primary tumour on thex axis and the recurrence onthe rim. SVs were coloured by detection in either only the primary tumour (red), 
yaxis. Inthe last panel, two matched recurrences are shown witha recurrence only inthe relapse (grey) or in both (blue). Each combination also has a Venn 
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comparison has a table showing the total number of events in each quadrant tumour, the recurrence or both. 
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Extended Data Fig. 8 | Mutations in primary tumours and relapses. a, Box 
plots showing the total number of SNVs or indels in primary tumours (n=20) 
compared to relapses (n=12). Boxes show the median, first and third quartile, 
and whiskers extend to 1.5x the interquartile range. We detected, on average, 
1,180 SNVs (range, 339-2,544) and 468 indels (range, 299-1,026) in primary 
tumours and 5,162 SNVs (range, 2,992-7,773) and 847 indels (range, 554-1,187) 
in relapsed tumours throughout the genome. In coding regions, there were on 
average 14 non-synonymous SNVSs (range, 3-45) and 2indels (range, 0-7) in 
primary tumours and 59 non-synonymous SNVs (range, 37-92) and 6 indels 
(range, 2-11) inrelapsed tumours. b, Percentage of substitutions of either the 
combined primary tumours (n= 20) or combined relapses (n =12) divided by 
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substitution type and affected strand for SNVs residing in transcribed regions. 
Transcriptional asymmetry is defined as the difference between the amount of 
SNVs onthe transcribed strand versus the untranscribed strand for each 
substitution type. Data are mean +s.e.m., Pvalues were calculated using two- 
sided Poisson tests; ***P< 0.0005, **P< 0.005, *P<0.05.c, Substitution-type 
probability based on the 96 different trinucleotide contexts for amatched 
primary relapsed pair shown ind compared toa cisplatin signature’ and new 
paediatric cancer signature (P1)”. d, Cosine similarity between the cisplatin 
signature and other signatures (n= 36). Pvalues were calculated using pairwise 
pearsoncorrelation applied to the similarity matrix; ***P< 0.0005, **P< 0.005, 
*P<0.05. 
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Extended Data Fig. 9| See next page for caption. 


Extended Data Fig. 9 | ETMRs have dense and strongly conserved C>T and 
C>G mutations around breakpoints. a, Rainfall plot showing an example of 
kataegis around C19MC. Every point represents a somatic SNV coloured by 
substitution type, the x axis represents the position in the genome and the 
position onthe yaxis represents the density of SNVs. b, Lollipop plot showing 
SNVs per 1kbinaregion of 10,000 bp surrounding breakpoints for all ETMRs. 
Pins represent the percentage of substitution types of all SNVs within 1kb, 
while the height of the lollipops represents the substitutions per kb. 

c, Percentages of substitution types in regions 10 kb around breakpoints (left, 
n=543 SNVs) andthe rest of the genome (right, n= 84991 SNVs). Pvalues were 
calculated using a one-sided Fisher’s exact test; ***P< 0.0005, **P<0.005, 
*P<0.05.d, Combined mutation density of four primary tumours coloured by 
conservation in the matched recurrence (blue is conserved, grey is not 
conserved) as shown by arainfall plot (top), a density distribution (middle) and 
the breakpoint density (bottom). e, Allele frequencies of all primary (x axis) 


versus relapse (y axis) tumours. Boxes show conservation (lost, primary 

AF >10% and recurrence AF < 2%; conserved, primary AF > 20% and recurrence 
AF > 20%; and gained, primary AF <2% and recurrence AF > 10%) (n=2,100 SNVs 
with allele frequency over 20% in the primary tumour). Pvalue was calculated 
using atwo-sided x’ test. f, Percentage of substitution types for SNVsineach 
quadrant (lost, primary AF > 10% and recurrence AF < 2%; conserved, primary 
AF >20% and recurrence AF > 20%; and gained primary AF < 2% and recurrence 
AF >10%). g, Ratio of conserved SNVs compared with not conserved SNVs in 
regions around breakpoints with increasing sizes. Conservation is defined as 
SNVs with an allele frequency over 20% in the primary tumour and anallele 
frequency over 20% in the recurrence, SNVs with an allele frequency lower than 
20% in the recurrence but higher than 20% in the primary tumour were defined 
as not conserved. Pvalue between 10 kb around breakpoints and the rest of the 
genome using a two-sided x’ test (n=2,100, P=5.4 x10™). 
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Extended Data Fig. 10 | See next page for caption. 


Extended Data Fig. 10 | Context of R-loops and DNA damage in ETMRs and 
after Dicer1 knockout. a, Genome-wide density of R-loops in ETMRs, R-loops 
in Ewing sarcoma (EWS), RLFS and gene density. b, Representation of SVs 
genome-wide and their breakpoint context. Outer layers show the density of 
DRIP peaks (blue) or RLFS (red). The inner part shows all SVs from ETMRs 
sequenced using WGS, depicting SVs that fall in DRIP-seq peaks (blue) or RLFS 
(red). c, R-loop signal detected in genomic regions sorted by R-loop signal 
(including elements from non-B-DB” and repeatmasker). R-loop signal was 
determined for 10,000 randomly selected elements for every type of genomic 
feature (n=21). Violin plots depict kernel density estimates and represent the 
density distribution. d, Genome-wide association of breakpoints with genomic 
regions sorted by R-loop signal shown inc. Genome-wide associations were 
calculated as distance to nearest element compared to aset of 10,000 
randomly generated breakpoints. Enrichments were calculated for Ewing 
sarcoma breakpoints” and breakpoints from other entities”? (reference set). 
Pvalues were calculated using atwo-sided Mann-Whitney U-test and adjusted 


for multiple testing (Benjamini-Hochberg correction). e, Density of distances 
between genomic regions and breakpoints detected in ETMR, Ewing sarcoma, 
random breakpoints and reference breakpoints. f, Total percentage of 
breakpoints within 1 kb of genomic regions. g, Enrichment of SNVs (n= 85,534) 
in ETMRR-loops (n=16,002 regions) and RLFS (n=85,534 regions) compared 
torandom regions of the same size. Pvalues were calculated using a two-sided 
X test; ***P< 0.0005, **P< 0.005, *P< 0.05. h, Genome-wide distribution of 
mouse RLFS and breakpoints occurring in Dicerl knockout cells compared to 
wild-type. The outer rim shows the genome wide density of mouse RLFS, the 
inner rim the CNAs that were found between wild-type and knockout cells and 
the inner part shows the SVs that were detected between wild-type and 
knockout cells. Breakpoints falling within RLFS are highlighted in red. i, Copy- 
number profiles of an example of atranslocation coupled to duplicationin 
RLFS that were found in Dicerl knockout compared to Dicer wild-type cells. 
Red arrows depict the location of the translocation and duplication. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


4 The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


— For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


[| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Software used for data collection was provided by the manufacturer (bcl2fastq (v2.2), QC performed with fastQC (0.11.4) (Illumina) or 
GCOS (3000) for microarray expression data (Affymetrix), minfi (v1.14.0), limma (v.3.24.15) and is described in the methods section. 
Data software to collect data using confocal microscopy was available from zeiss (Zen suite LSM 800). 


Data analysis All software used is described in the methods section. Besides custom scripts public software that was applied includes: biobambam 
version 0.0.148), bwa-mem-0.7.8-r455 , Roddy (version 1.1.73), Samtools 0.1.17-r973, Platypus (0.8.1), Delly (version 0.7.5), R (3.4.3), 

ACS (2.0), MASS5.O, R2 (Revision 9522207), Graphpad Prism (v6.01), ImageJ (1.52e), Cibersort (v1.06), Picard (v1.125), Freebayes 
v1.1.0),vt (v0.5), Eland (v2), Varscan (version 2.3.9), DEseq2(1.18.1, NaviGO, DAVID (v6.8), Toppgene (7), deeptools (3.0.2), qmRLFS 
finder (v1.5), conumee (v1.3), MutationalPatterns (v1.4.3), Compusyn, Bowtie (v1.00), IGV (v2.3.97), Rtsne (v0.13), SVA (3.32.1), kallisto 
v0.43.0) , Sleuth (vO.30.0) and Annovar (2016-02-01). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


All data used in this study will be made available before publication, as stated in the manuscript, within the corresponding public repository. Methyaltion and 
expression array data is available under accession number GSE122038 in the Gene Expression Omnibus (GEO) and sequencing data is available under 
EGAS00001003256 at the European Genome-phenome Archive (EGA). Source data are available for Fig. 1a-c, Fig. 2c, Fig. 3b, c, Fig. 4d, g, Fig. 5, a, b, d, Ext. Data Fig. 
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1a, Ext. Data Fig. 2a-g, Ext. Data Fig. 4b, c, h, Ext. Data Fig. 5c, Ext. Data Fig. 6b, c, Ext. Data Fig. 8a-d, Ext Data Fig. 9b, c, e, f, g and Ext Data Fig 10 g. Any additional 
data/material used in this study will be made available upon reasonable request. 
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For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample size was determined based on the amount of available material, no sample size calculation was performed. We argue that the sample 
size used here is sufficient, first of all because ETMR tumor material is sparse and therefore it is difficult to expand the cohort and second of 
all because our sample size is comparable to other large genomic studies. 


Data exclusions Data was excluded from analysis if QC measures were not met. These measures were taken to prevent erroneous assumptions about the data 
that were caused by technical artifacts. FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) was applied supplemented 
with additional criteria after mapping including testing for equal coverage, the number of unmapped reads, presence of potential sample 


swaps based on SNP fingerprinting, poor read quality, uncertainty about the diagnosis of the sample, tumor cell content. 


Replication All experiments have been performed in triplicate and only representative data is shown. Dot blot experiments have been performed at least 
five times to ensure reproducibility. All attempts at reproducibility were succesfull 


Randomization No randomization was applied. All samples were included based on diagnosis as is stated in the methods section. 


Blinding Patient identity was blinded through anonymisation, however patient age, tumor location, histology, patient gender and treatment outcome 
were made available. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Anti-DNA-RNA hybrid (clone S9.6) antibody (Merck, MABE1095), yH2AX (Abcam, ab11174) Dilution: 1:200 lot #3011493 


Validation Antibodies were validated by the manufacturer as stated on the manufacturers website. Specificity of S9.6. was validated using 
both human and mouse material treated with RNASEHI as an additional negative control. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) The BT183 cell line was kindly provided by J.A.Chan. Dicer WT/KO cell lines were purchased from from ATCC (CRL-3220, 
CRL-3221) 
Authentication Cell lines were authenticated using similarity in variable SNP against the primary tumor in the case of the BT183 cell line and 


by using methylation clustering. Cell lines obtained from ATCC have been authenticated using STR profiling. 
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Mycoplasma contamination All cells were found negative for mycoplasma infection 


Commonly misidentified lines No commonly misidentified lines were used 
(See ICLAC register) 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics All patient characteristics are available in supplementary table 1 if available. In general, the majority of patients was under the 
age of three at diagnosis, there was no gender bias (male:female=1:1), location of the tumors was evenly spread throughout the 
brain as is described in Ext. Data Figure 1b. Relapsed patients have been clearly annotated and were analyzed seperately 


Recruitment Samples were collected from biopsies taken between 2000 and 2018 at contributing centers of ETMR patients that were treated 
at the respective centers. All patients included in the study were enrolled under informed consent according to the ICGC 
(www.icgc.org) guidelines or INFORM (www.dkfz.de/en/inform/) guidelines. No bias caused by recruitment is expected however 
possible under-diagnosis of patients lacking C19MC amplification (due to difficulty of diagnosis) cannot be excluded which could 
have lead to a lower proportion of patients lacking CL9MC amplification. 
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Ethics oversight Ethics oversight has been conducted by the ICGC and INFORM consortia 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Limited knowledge of the mechanisms that govern the self-renewal of human 
haematopoietic stem cells (HSCs), and why this fails in culture, have impeded the 
expansion of HSCs for transplantation’. Here we identify MLLT3 (also known as AF9) 
as a crucial regulator of HSCs that is highly enriched in human fetal, neonatal and 
adult HSCs, but downregulated in culture. Depletion of MLLT3 prevented the 
maintenance of transplantable human haematopoietic stem or progenitor cells 
(HSPCs) in culture, whereas stabilizing MLLT3 expression in culture enabled more 
than 12-fold expansion of transplantable HSCs that provided balanced multilineage 
reconstitution in primary and secondary mouse recipients. Similar to endogenous 
MLLT3, overexpressed MLLT3 localized to active promoters in HSPCs, sustained levels 
of H3K79me2 and protected the HSC transcriptional program in culture. MLLT3 thus 
acts as HSC maintenance factor that links histone reader and modifying activities to 


modulate HSC gene expression, and may provide a promising approach to expand 
HSCs for transplantation. 


HSCs can self-renew throughout their lifetime while replenishing all 
blood lineages, making HSC transplantation a life-saving treatment 
for many blood diseases. However, a lack of HLA-matched bone mar- 
row donors and a low yield of HSCs in cord blood limit the number 
of patients that can be treated’. A better understanding of HSC self- 
renewal is required to expand human HSCs in culture or to generate 
them from pluripotent stem cells. 

HSCs develop during embryogenesis from haemogenic endothe- 
lium in large arteries and expand in the fetal liver before colonizing the 
bone marrow’. Although many factors that drive the specification of 
haemogenic endothelium and HSCs have been identified, we know less 
about those that maintain HSC self-renewal. Here we identify MLLT3 as 
acrucial regulator of human HSC maintenance, and show that restoring 
MLLT3 levels in cultured human HSCs protects stemness and enables 
the ex vivo expansion of transplantable HSCs. 


MLLT3 is enriched and required in human HSCs 


To define the molecular machinery that governs human HSC self- 
renewal and determine why it fails in culture, we compared the tran- 
scriptomes of highly self-renewing HSPCs from human fetal liver to 
their immediate progeny’ and to dysfunctional, cultured HSPCs, 
derived from fetal liver or embryonic stem cells*. From the 12 nuclear 
regulators correlating with self-renewal, MLLT3 was selected for fur- 
ther study (Fig. 1a, Extended Data Fig. la, b). MLLT3 is acomponent 
of the superelongation complex° and co-operates with DOTIL, which 
di/trimethylates H3K79 to promote transcription’ ®. MLLT3 localizes 


to active transcription start sites (TSSs) through the YEATS domain, 
which recognizes active histone marks such as H3K9 acetylation and 
crotonylation®”®. A truncated MLLT3 that lacks the YEATS domain 
forms a leukaemic fusion protein with the N terminus of MLL1, which 
misdirects MLLT3-interacting complexes to induce aberrant gene 
transcription” “*. MLLT3 also regulates erythroid or megakaryocytic 
progenitors” and was identified as a definitive HSC hub gene during 
mouse development”. 

MLLT3 expression was enriched in undifferentiated human HSPCs 
in fetal liver, cord blood and bone marrow (Extended Data Fig. 1c). 
RNA-sequencing (RNA-seq) analysis of developmental haematopoietic 
tissues showed that MLLT3 was upregulated in fetal liver” (Extended 
Data Fig. 1d), whereas genes specific to the development of haemo- 
genic endothelium and HSCs suchas TAL1 (also known as SCL), RUNX1, 
SOX17 and HOXA genes were already highly expressed in 5 week aorta- 
gonad-mesonephros, which suggests that MLLT3 is involved in HSC 
maturation and maintenance. 

To determine whether human HSCs require MLLT3, two validated 
MLLT3 short hairpin RNAs (shRNAs) (Extended Data Fig. le, f) were tested 
inan HSPC expansion culture system using the OP9M2 stromal stem-cell 
line*. Both of the shRNAs resulted in premature depletion of fetal liver 
HSPCs (FL-HSPCs) in vitro (Fig. 1b, c, Extended Data Fig. 1g-j). When FL- 
HSPCs transduced with MLLT3-knockdown (KD) or control vector were 
transplanted into immunodeficient NSG (NOD-SCID //2rg-null) mice, only 
thecontrol cells showed multilineage (myelo/lymphoid) human haemat- 
opoietic reconstitution (Fig. 1d, e, Extended Data Fig. 1k, Supplementary 
Table 1), which indicates an important regulatory function for MLLT3. 
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Fig. 1| MLLT3 regulates human HSPC expansion. a, Venn diagram of 
microarray gene expression data, identifying genes enriched in self-renewing 
human FL-HSPCs. Number of genes downregulated after differentiation (pink) 
of fetal liver CD34*CD387'°CD90*GPI80* HSCs to CD34*CD387'°CD90*GPI80- 
progenitors’; number of genes downregulated in FL-HSPCs during 5-week 
culture on OP9M2 stroma (green)*; and number of genes suppressed in human 
embryonic stem (ES)-cell-derived HSPCs (purple)' are shown. b, FACS analysis 
30 days after transduction of CD34*CD387"°CD90* HSPCs with MLLT3 shRNA 
(MLLT3-KD) or empty vector control (CTR) (representative of three plots).c, 
Quantification of cells as in b after 5, 15 and 30 days in culture (n =3).d, FACS 
analysis of bone marrow from NSG mice 12 weeks after transplantation of FL- 
HSPCs transduced with MLLT3-KD or empty vector control (representative of 
10 mice). e, Quantification of human (h) CD45* cells in bone marrow (BM) from 


Sustaining MLLT3 levels improves HSPC culture 


As studies had shown expansion of multipotent human HSPCs on 
OP9M2 stroma without measurable expansion of transplantable 
HSCs*, we asked whether maintaining MLLT3 expression in cultured 
HSPCs improves their function. Restoring physiological MLLT3 levels 
using an overexpression lentiviral vector (MLLT3-OE) significantly 
enhanced the expansion of CD34*CD38“"°CD90* FL-HSPCs on OP9M2 
stroma (78-fold greater than controls at 6 weeks) (Fig. 1f, g, Extended 
Data Fig. 2a, b). Targeting MLLT3-OE in the most undifferentiated 
CD34*CD38“°CD90*GPI80* FL-HSPCs? recapitulated the expansion 
phenotype, whereas MLLT3-OE in CD34*CD38°CD90*GPI80° pro- 
genitors did not confer the HSC immunophenotype or expand them 
(Extended Data Fig. 2c-e). Withdrawing OP9M2 stroma depleted undif- 
ferentiated MLLT3-OE HSPCs, demonstrating a dependence ona HSC- 
supportive microenvironment (Extended Data Fig. 2f-h). MLLT3-OE 
also enhanced FL-HSPC expansion when cultured in clinically suitable 
conditions using serum-free expansion medium (SFEM) andthe small 
molecules SR1’8 and UM171”, or on OP9M2 stroma with both SR1 and 
UMI171 (Extended Data Fig. 2i,j), showing the beneficial effects on HSPC 
expansion on all HSC-supportive conditions tested. 

BrdU incorporation assays did not show enhanced proliferation 
of MLLT3-OE HSPCs in culture (Extended Data Fig. 3a, b). Staining of 
annexinV and 7-aminoactinomycin D (7AAD) showed greater viability 
of MLLT3-OE FL-HSPCs than empty-vector-transduced FL-HSPCs, but 
reduced cell viability compared with uncultured FL-HSPCs (Extended 
Data Fig. 3c). Differentiation of MLLT3-OE HSPCs after a 4-week expan- 
sion showed comparable monocytic, granulocytic, erythroid, mega- 
karyocytic, T and B lymphoid differentiation potential to non-expanded 
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NSG mice treated as ind (n=10 mice, two independent experiments). f, FACS 
analysis of CD34*CD387"°CD90* FL-HSPCs transduced with control or MLLT3- 
OE lentiviral vector (representative of six experiments). g, Expansion of HSPCs 
as inf (n= 6 independent experiments). h-j, FACS analysis showing human 
haematopoietic reconstitution (hCD45 expression) (h) and quantification of 
total hCD45 cells (i) or human HSPCs (hCD45*CD34*CD38~) (j) inbone 
marrow from NSG mice. Iniandj, the number of mice showing multi-lineage 
reconstitution versus the number of total transplanted mice is shown (n=13 or 
14 mice, 4 independent experiments). APCcy7, PEcy7, BV421 and BV711 denote 
fluorochrome dyes. Data inc denote mean values; dataine,iandj denote mean 
and individual values; datain g are mean +s.e.m. All Pvalues determined by 
two-sided t-test. 


FL-HSPCs (Extended Data Fig. 3d, e). Hence, sustaining MLLT3 expres- 
sion in cultured HSPCs maintains their identity and viability without 
causing excessive proliferation or resistance to apoptosis, or blocking 
differentiation. 


MLLT3 enhances transplantation of cultured HSPCs 


To assess whether maintaining MLLT3 levels in cultured HSPCsimproves 
the reconstitution potential in NSG mice, CD34*GFP” fetal liver cells 
sorted at day 5 and transduced with MLLT3-OE or control vector were 
expanded in SFEM containing SR1 and UM171 and transplanted on day 15 
(Extended Data Fig. 4a). Mice transplanted with MLLT3-OE cells showed 
more frequent long-term (24 week) human multilineage engraftment 
inthe bone marrow, and higher levels of engraftment than mice trans- 
planted with control HSPCs (Fig. 1h, i, Extended Data Fig. 4b, Supple- 
mentary Table 2a). Most mice transplanted with MLLT3-OE cells, but 
not with control cells, also contained human HSPCs in the bone marrow 
and generated myeloid and lymphoid cells in the peripheral blood and 
spleen (Fig. 1j, Extended Data Fig. 4c—e). Increased haematopoietic 
reconstitution with MLLT3-OE HSPCs was not explained by altered 
proliferation of MLLT3-OE HSPCs or differentiated cells in recipient 
mice or in culture before transplantation (Extended Data Fig. 4f-j). 
To ascertain the role of extended culture in MLLT3-mediated 
enhanced in vivo reconstitution, the engraftment levels were com- 
pared between CD34*GFP* cells sorted 5 days after transduction and 
their progeny sorted at 15 days. Culture with MLLT3-OE, but not control 
vector, significantly increased total human haematopoietic reconstitu- 
tion and HSPC reconstitution (Extended Data Fig. 4k—-m). When bone 
marrow from primary mice was transplanted into secondary mouse 
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Fig.2|MLLT3 binds tothe TSSs of active genes ina cell-type-specific manner. 
a, Distribution of MLLT3 peaks in FL-HSPCs. b, Genes expressed in FL-HSPCs 
(RPKM >1,n=3) were divided into MLLT3-bound genes (780, peak <5 kb from 
the TSS) and MLLT3-unbound genes (1,312, no detectable peak), and compared 
to 500 randomly selected non-expressed genes (RPKM < 0.1). Red lines denote 
median values. Pvalue determined by two-sided t-test. c, Average profile and 
heat map for MLLT3, H3K9ac, RNA Pol II and H3K79me2 ChIP-seq in FL-HSPCs. 
Metagene plot + 2 kbis shown. d, GO analysis of MLLT3-bound genes in FL- 
HSPCs with gene numbers for each category. e, Heat maps for selected MLLT3- 
bound gene groups. IER, immediate early response. f, Venn diagram comparing 
MLLT3 peaks in FL-HSPCs and fetal liver erythroblasts (FL-Ery).g, UCSC 
genome browser tracks showing ChIP-seq of MLLT3 and epigenetic marks in 
representative MLLT3-bound genes in FL-HSPCs and FL-erythroblasts. Ina-g, 
n=3MLLT3 in FL-HSPCs, n=2 others. 


recipients, only MLLT3-OE cells showed multilineage reconstitution 
after 18 weeks (Extended Data Fig. 4n, 0). These data indicate that 
maintaining MLLT3 expression in FL-HSPCs during culture enhances 
human multilineage haematopoietic reconstitution in primary and 
secondary recipients. 


MLLT3 binds to TSSs of active genes in human HSPCs 


To understand how MLLT3 regulates human HSC stemness, we 
assessed the MLLT3 chromatin-binding pattern in FL-HSPCs. Chro- 
matin immunoprecipitation followed by high-throughput sequenc- 
ing (ChIP-seq) showed MLLT3 binding at 1,579 sites, with strongest 
enrichment around TSSs and within 5 kb downstream (Fig. 2a). MLLT3 
peaks associated with 889 genes, 96.4% of which were expressed in 
FL-HSPCs, with reads per kilobase of transcript per million mapped 
reads (RPKM) values greater than one (Supplementary Table 3). The 
k-means clustering analysis showed co-localization of MLLT3 peaks 
with marks of active TSSs (assay for transposase-accessible chroma- 
tin using sequencing (ATAC-seq) peaks, H3K4me3, H3K9ac, H3K9cr, 
H3K27ac and RNA polymerase (Pol) II) (Extended Data Fig. 5a). The 
gene body histone mark H3K79mez2 partially overlapped with MLLT3, 
whereas H3K36me3 was not enriched. There was minimal overlap with 
the enhancer mark H3K4mel or the repressive marks H3K27me3 and 


H3K9me3. Thus, MLLT3 predominantly localizes to active promoters 
in HSPCs. 

Comparing MLLT3-bound genes to other expressed genes showed 
that, although both contained active epigenetic marks at TSSs, MLLT3- 
bound genes featured higher median expression, higher H3K79me2 
enrichment and higher RNA Pol II occupancy (Fig. 2b, c, Extended 
Data Fig. 5b). Gene Ontology (GO) analysis of MLLT3-bound genes 
in FL-HSPCs revealed enrichment of biological processes involved in 
regulation of gene expression and nucleosome assembly (for example, 
histone genes), immune system development and haemopoiesis (for 
example, HSC transcription factors RUNX1, MYB, MECOM and HOXA9) 
and translation (for example, ribosomal proteins) (Fig. 2d, Supplemen- 
tary Table 3a—c). Analysis of epigenetic marks in distinct MLLT3-bound 
gene groups revealed differential enrichment for H3K79mez2 and Pol 
Il: the histone genes and immediate early response genes (such as JUN 
and FOS) showed high Pol II occupancy but low H3K79mez2, whereas 
HSC genes and ribosomal protein genes showed high enrichment for 
H3K79me2 (Fig. 2e, Extended Data Fig. 5c). These data suggest that 
MLLT3 may regulate distinct target genes in HSPCs by influencing 
H3K79mez2 and/or Pol Il activity. 

Analysis of MLLT3 binding in erythroblasts from fetal liver (Fig. 2f, 
Supplementary Table 3d) revealed cell-type specificity: 200 MLLT3 
peaks were identified around TSSs, partially overlapping with FL-HSPC 
peaks (Extended Data Fig. 6a—c). Common GO categories included 
nucleosome assembly, whereas erythroid-specific categories included 
oxygen transport and haem metabolic processes (Extended Data 
Fig. 6d). Genome browser tracks demonstrated the correlation of 
MLLT3 binding with epigenetic marks of active TSS in each cell type 
(Fig. 2g, Extended Data Fig. 6a), and enrichment of H3K79me2 in MLLT3- 
bound HSC transcription factor genes. 


MLLT3 protects HSC gene expression in cultured HSPCs 


We next asked how sustaining MLLT3 expression in cultured FL-HSPCs 
modulates their transcriptional program. ChIP-seq data showed similar 
distribution of MLLT3-bound peaks and genes in uncultured HSPCs and 
MLLT3-OE-vector transduced HSPCs after a4-week culture (Extended 
Data Fig. 7a—c). RNA-seq showed that modest differences in MLLT3 
expression (2.77-fold MLLT3-OE versus control vector in HSPCs after 
4-week culture) resulted in significant differential expression of 541 
upregulated and 717 downregulated genes (Fig. 3a, Supplementary 
Table 4). Genes that regulate translation and glycolysis, and several HSC 
transcription factors (MECOM (also knownas EVI/1), HLF, MYB and GFI1) 
and HSC surface proteins (c-K/T (also known as KIT), CXCR4, ROBO4, 
EMCNand PROMI (also knownas CD133)) were significantly upregulated 
in MLLT3-OE HSPCs (Fig. 3b). Programs related to immune response 
and apoptosis were suppressed (Fig. 3b). Although MLLT3 binding was 
equally distributed between upregulated and downregulated genes, 
MLLT3 binding was enriched in specific gene categories such as MLLT3- 
OE-upregulated HSC transcription factor genes, and downregulated 
nucleosome assembly (for example, histone) genes. Immune response 
and apoptosis genes suppressed in MLLT3-OE HSPCs showed minimal 
binding (Fig. 3b-d). Comparison of cultured and uncultured HSPCs 
suggested that MLLT3-OE may help to diminish culture-associated 
drift in gene expression, either directly (HSC factors) or indirectly 
(immune response genes)‘ (Fig. 3d, e, Extended Data Fig. 7d). Functional 
assessment of 2 of the 12 candidate HSC factors identified in Extended 
Data Fig. la~-MECOM and HLF—both bound and upregulated by MLLT3, 
validated them as important MLLT3 downstream effectors that sustain 
HSC stemness. Knockdown of either factor resulted in premature HSPC 
exhaustion and diminished the effects of MLLT3-OE on HSPC expansion 
(Extended Data Fig. 7e-h). 

Given the strong association between MLLT3 binding and H3K79me2 
deposition in genes that encode HSC regulators, we asked whether 
MLLT3-OE protects the expression of HSC genes in cultured HSPCs 
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Fig. 3| MLLT3 protects HSC stemness program through DOTIL and 
H3K79me2.a, Venn diagram showing the overlap between MLLT3-bound (n=6 
MLLT3 ChIPs combined; Extended Data Fig. 7b) and MLLT3-OE up- or 
downregulated genes. Total number and percentage of genes are shown. b, GO 
analysis of genes up- or downregulated in MLLT3-OE FL-HSPCs at 4 weeks. 
Numbers and percentage of MLLT3-bound and total genes are shown.c, 
Examples of gene groups up- or downregulated by MLLT3-OE with MLLT3- 
bound genes marked in green. Numbers and percentage of MLLT3-bound and 
total genes are shown. d, Heat map showing differentially expressed genes in 
HSPCs transduced with MLLT3-OE or control vector after 4-week expansion 
(n=6 independent experiments). P< 0.05, Benjamini-Hochberg adjusted t- 
test.e, Differential expression (DE) of MLLT3-regulated genes between 
uncultured HSPCs (n =3) and 4-week expanded FL-HSPCs (n=6). Selected 
MLLT3-regulated genes that are similarly regulated in uncultured HSPCs are 
highlighted. MLLT3-bound genes are in bold. f, Quantification of RNA-seq and 
ChIP-seq signals in MLLT3-bound, upregulated HSC transcription factor 
genes, and unbound, downregulated immune response genes. RNA-seq 
represents fold change between expanded HSPCs transduced with MLLT3-OE 
or control vector. ChIP-seq shows ratio of MLLT3-OE and control signal 
normalized to non-MLLT3-bound housekeeping gene (VCL) (n=6 RNA-seq;n=5 
H3K79me2 ChIP, n=2 other ChIPs). Data are mean+s.e.m. Pvalues determined 
by two-sided t-test. g, Ratio of H3K79me2 signal in MLLT3-bound upregulated 
HSC genes between MLLT3-KD and control vector after 5-day culture (data are 
mean and individual values, n=10 genes, two experiments) (left), and between 
MLLT3-OE, MLLT3-OE plus EPZ5676 (DOTIL inhibitor), or CTR plus EPZ5676 
and control vector after 10-day culture (middle). H3K79mez2 signal is 
normalized to S2 cell chromatin spike-in. MLLT3-OE versus control from 
4-week culture is also shown (right). Pvalues in g determined by one-sample 
two-sided t-test. NS, not significant. 
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through DOTIL and H3K79mez2. A significant MLLT3-dependent 
increase in H3K79mez2 was observed in MLLT3-bound haematopoi- 
etic regulators but not inimmune response genes that were indirectly 
downregulated by MLLT3-OE (Fig. 3f, Extended Data Fig. 8a). Other 
active marks tested (H3K9ac, H3K4me3 and H3K36me3) showed little 
change with MLLT3-OE. Dependence of H3K79me2 on MLLT3 levels was 
verified in MLLT3-KD HSPCs (Fig. 3g). Culture with the DOTIL inhibitor 
EPZ5676”° reduced MLLT3-OE-associated increase in H3K79me2 in HSC 
genes (Fig. 3g). EPZ5676 decreased H3K79me2 in both MLLT3-bound 
and non-bound genes, whereas MLLT3-KD and MLLT3-OE only affected 
H3K79me2 in MLLT3-bound genes (Extended Data Fig. 8b). These data 
suggest that, although MLLT3 is not required for DOTIL activity and 
H3K79me2 deposition per se, MLLT3 cooperates with DOTILin human 
HSCs to enhance H3K79me2 deposition in HSC regulatory genes and 
to maintain their activity during culture expansion. 


MLLT3 enables ex vivo expansion of cord blood HSCs 


The low number of HSCs in cord blood limits their use for transplanta- 
tion, despite their better availability and more permissive HLA-match- 
ing than bone marrow. We therefore asked whether MLLT3 can be used 
to control the self-renewal of cord-blood HSCs (CB-HSCs). MLLT3-KD 
incord-blood HSPCs (CB-HSPCs) severely impaired HSPC maintenance 
in co-culture experiments with OP9M2 stroma cells (Extended Data 
Fig. 9a, b). Conversely, MLLT3-OE improved CB-HSPC expansion bothin 
co-culture with OP9M2 cells and inserum-free conditions with UM171 
and SR1 (Fig. 4a, b, Extended Data Fig. 9c, d). 

Alimiting-dilution transplantation assay (Fig. 4a) was used to quan- 
tify the expansion of transplantable HSCs (repopulating units) during 
CB-HSC culture. Several doses of MLLT3-OE-transduced and control- 
transduced cells were transplanted either at day 5 after transduction 
or day 15 after culture and compared to uncultured CB-HSPCs. When 
high doses were transplanted, multilineage human haematopoietic 
reconstitution was observed in mice transplanted with uncultured 
cells or with HSPCs transduced with control vector or MLLT3-OE vector 
(Fig. 4c). Quantification of lineage differentiation in engrafted mice 
confirmed acomparable differentiation ability of MLLT3-OE and uncul- 
tured HSPCs (Fig. 4d), whereas HSPCs expanded by control vector 
showed myeloid bias. Moreover, mice engrafted with MLLT3-OE cells 
showed both increased HSPCs and total reconstitution compared with 
equal cell doses of control cells (Fig. 4e, f). Analysis of mice transplanted 
at limiting dilutions verified robust expansion of MLLT3-OE HSCs at 
15 days of culture; 12.5-fold increase in repopulating units compared 
with uncultured HSPCs and a6.8-fold increase compared with MLLT3- 
OE HSPCs transplanted at day 5 (Fig. 4f-h). Therefore, increased recon- 
stitution ability was not caused merely by the expansion of MLLT3-OE 
cells in recipient mice. Although cells transduced with control vector 
showed some expansion (2.4-fold compared with uncultured HSPCs) 
at 15 days, the expansion of repopulating units with MLLT3-OE was 
5.2-fold higher. These data corroborate the importance of maintaining 
MLLT3 levels during culture, both to achieve greater HSC expansion 
and preserve differentiation potential. 

Blood and spleen showed solid human haematopoietic recon- 
stitution by MLLT3-OE-expanded cells without evidence of lineage 
bias, whereas control-vector-expanded cells showed inconsistent, 
lower-level engraftment (Extended Data Fig. 10a—d). Secondary trans- 
plantation of bone marrow from primary mice resulted in human hae- 
matopoietic reconstitution with uncultured and MLLT3-OE cells, but 
not with control-vector-expanded cells (Extended Data Fig. 10e, f). 
Neither primary nor secondary mice transplanted with MLLT3-OE cells 
suffered increased mortality or expansion of immature populations 
(Extended Data Fig. 10g, h), unlike mice transplanted with haematopoi- 
etic cells expressing the oncogenic fusion protein MLLI-MLLT3 that 
rapidly developed leukaemia”. Thus, sustaining MLLT3 expression 
in CB-HSPCs during culture enables a more than 12-fold expansion of 
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Fig. 4 | MLLT3-OE expands transplantable CB-HSCs in culture. a, Limiting 
dilution analysis for assessing MLLT3-OE effects on CB-HSPC expansion and 

in vivo reconstitution. b, Quantification of total celland HSPC expansionin 
culture (n=3). Data are mean values. P values determined by two-sided t-test. 
c, Representative FACS plots showing human haematopoietic reconstitution in 
mice transplanted with uncultured CB-HSPCs or CB-HSPCs transduced with 
control or MLLT3-OE vectors and transplanted at day 15 after culture and 
analysed 24 weeks after transplantation. FACS plots show total human 
haematopoietic reconstitution (CD45), multilineage differentiation (myeloid, 
B-lymphoid and T-lymphoid cells) and HSPCs in hCD45‘ cells. d, Quantification 
of differentiated populations as a percentage of hCD45‘ cells inthe bone 
marrow of engrafted mice. e, Quantification of HSPCs as a percentage of live 
bone marrow cellsin engrafted mice. f, Quantification of human 


transplantable HSCs that maintain balanced multilineage haemat- 
opoiesis, demonstrating self-renewal of human HSCs in culture. 


Discussion 


Our discovery that restoring MLLT3 levels improves the expansion of 
engraftable, multipotent human HSCs when cultured with SR1 and 
UM171—small molecules that show promise for clinical HSC expan- 
sion”’?*4— suggests that combining several strategies to support HSCs 
may be optimal for clinical translation. Notably, recent studies showed 
a marked increase in mouse HSC activity after replacing albumin in 
culture medium”. As harnessing HSC self-renewal in culture becomes 
areality, the potential for transformation and clonal haematopoiesis 
needs to be monitored carefully, independent of the expansion proto- 
col used. Because every cell division in itself poses a risk for mutations, 
the goal should not necessarily be maximal HSC expansion, but safe 
expansion that preserves HSC integrity. Although no adverse effects 
were observed with the MLLT3-OE vector, the use of transient, non- 
integrating methods to maintain MLLT3 levels would be more appro- 
priate for clinical use. 

Our work identified MLLT3 as an ‘HSC maintenance factor’ that pre- 
serves—rather than confers—HSC stemness, by reinforcing programs 
established by other factors. Localization of MLLT3 to active TSSs of 
HSC genes in human HSCs in a cell-type-specific manner, using its 
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haematopoietic reconstitution in NSG mice at 24 weeks. Mice were 
transplanted in limited dilution doses with uncultured cells, or with transduced 
cells cultured for 5 or 15 days. The percentage of hCD45‘ cells in bone marrow 
and the number of multilineage engrafted versus total transplanted mice is 
indicated. Data in d-f are mean and individual values from three independent 
experiments. Pvalues determined by two-sided t-test. Inf, n denotes engrafted 
mice/total transplanted mice for each condition. g, h, Calculation of 
reconstituting units from limiting dilution assay. Datain h are expressed as 
reconstituting units (RU) per 1,000 uncultured CB-HSPCs transplanted 
directly or transduced with MLLT3-OE or control vector and cultured for 5 or 

15 days before transplantation. Expansion factors and Pvalues between 
reconstituting units in uncultured HSPCs and MLLT3-OE or control vector- 
expanded cells were calculated using ELDA software”’ and are shown. 


YEATS domain*”’, provides a molecular basis for the accurate binding 
of MLLT3-OE in cultured HSPCs. This explains why full-length MLLT3 
does not induce a differentiation block or ectopic activation of self- 
renewal, unlike the oncogenic MLLI-MLLT3 fusion protein, which 
rapidly converts haematopoietic progenitors to self-renewing leukae- 
mia stem cells”!?, Our work identifies MLLT3 as a central regulator 
of transcription factors that individually control HSC function?**8 
and suggests that MLLT3-DOTIL-dependent regulation of H3K79me2 
helps to maintain an active chromatin state in HSC regulatory genes 
during culture. With the ability to protect HSC stemness program as 
HSCs divide, without enhancing proliferation or imposing self-renewal 
program on progenitors, MLLT3 may help to improve HSC ex vivo 
expansion for clinical use. 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized, and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Human haematopoietic tissue collection and processing 

Human HSPCs from fetal liver and cord blood were used to assess the 
function of MLLT3 in fetal and neonatal human HSC. Second trimes- 
ter (14-18 weeks) fetal livers were de-identified, discarded material 
obtained from elective terminations of pregnancy after informed 
consent. Specimen age is denoted as developmental age, two weeks 
less than gestational age, and was determined by ultrasound or esti- 
mated by the date of the last menstrual period. Cord blood units were 
obtained from full-term pregnancies following informed consent and 
de-identified upon collection. Because these tissues are discarded 
material with no personal identifiers, this research does not constitute 
human subjects research. 

Fetal liver samples were mechanically dissociated using scalpels and 
syringes and strained through a 70-ypm mesh. Single-cell suspensions 
from fetal liver and cord blood were enriched in mononuclear cells by 
layering onLymphoprep (Stem Cells Technologies) and centrifugation 
following manufacturer instructions. CD34" cells were magnetically 
isolated from mononuclear cell fraction using anti-CD34 microbeads 
(Miltenyi Biotech). 


OP9M2stromaco-culture for HSPC expansion 

Human HSC co-culture on OP9M2 (subclone of OP9*, derived in our 
laboratory and validated by gene expression analysis) stroma was 
used to test the maintenance and expansion of immunophenotypic 
HSPCs. OP9M2 cells were irradiated (20 Gy) and pre-plated (50,000 
cells cm’) 24 h before the start of co-culture in OP9 medium, which 
includes a-MEM (Invitrogen), 20% fetal bovine serum (FBS; Omega) 
and 1x penicillin/streptomycin/glutamine (P/S/G). Human HSPCs were 
plated on stromal layer in OP9 medium supplemented with human HSC 
cytokines SCF (25 ng mI, Peprotech or Invitrogen), FLT3-L (25 ng mI, 
Peprotech) and TPO (25 ng ml“, Peprotech) (HSC medium). Cells were 
co-cultured at 37 °C and 5% CO, and re-plated or analysed/sorted by 
flow cytometry for HSPC markers (CD34*CD38°CD90*GPI80") every 
7-14 days. Half of the HSC-medium was replaced every 2-3 days. Where 
indicated, 500 nM StemRegenerin1 (SR1) and 35 nM UMI171 (SCT) were 
added to the cultures to improve human HSPC expansion. 


SFEM culture for HSPC expansion 

To test HSPC function in clinically relevant serum-free, stroma-free 
culture conditions, HSPCs were plated in StemSpan SFEM II (SCT) 
supplemented with human SCF (100 ng mI“), human FLT3-L (100 ng 
ml”), human TPO (50 ng mI"), human low-density lipoprotein (10 pg 
ml™, SCT), P/S/G, 500 nM SR1 and 35 nM UM171. Cells were cultured at 
37 °C and 5% CO, and re-plated or analysed/sorted by flow cytometry 
for human HSPC markers every 7-14 days. Half of the HSC medium 
was replaced every other day. In some experiments, DOTII inhibitor 
EPZ5676 (Cayman) was added at a concentration of 500 nM. 


Flow cytometry and cell sorting 

FACS analysis was performed using single-cell suspensions prepared 
from fetal liver and cord blood as described above, or obtained from 
NSG mice transplanted with human cells. 

For identification of human HSPCs, cells were stained with mouse 
anti-human monoclonal antibodies against human CD34-APC cl. 581 
(555824; BD, 1:20) or -BV605 (343529; Biolegend, 1:20), CD90-FITC cl. 
5E10 (555595; BD, 1:100) or -APC (555595; BD, 1:100), CD38-PE-Cy7 or 
-BUV496cl. HIT2 (560677 and 564657; BD, 1:100), GPI-80-PE cl. 3H9 
(DO87-5; MBL, used at 1:50). Cells were assayed on a BD-LSRII flow 


cytometer and data were analysed with FlowJo software (Tree Star). 
Cell sorting was performed using a BD FACS Aria ll. 

To monitor human haematopoietic engraftment in vivo in NSG mice, 
bone marrow cells were stained with rat anti-mouse-CD45-APC-H7 
cl.30-F11 (557659; BD, 1:100) and mouse anti-human monoclonal anti- 
bodies against human-CD45-BV711 or -BV785 cl.HI30 (304050, 304048; 
Biolegend, 1:100). Haematopoietic differentiation was assessed in vitro 
and in vivo using mouse anti-human monoclonal antibodies against 
human CD19-PE or BV605 cl. 1D3 or HIB19 (12-0193, 12-0199; eBio- 
sciences, 1:50), CD3-PE-Cy7 cl. SK7 (557851, BD) (eBiosciences, 1:50), 
CD4-APC (MHCD04035; Invitrogen, 1:50), CD8-PE cl. HIT8A (555635; BD, 
1:50), CD13-APC cl. WM15 (557454; BD, 1:50), CD66b-BV421 cl. G1OF5 
(555724; BD, 1:50), CD14-V500 cl. WM53 (561816; BD, 1:100), CD235a-PE 
or -APC (Glycophorin A, HIR2, BD 1:100), CD71-AF647 (DF1513, Santa 
Cruz1:100), CD41a-APC cl. HIP8 (579777; BD, 1:20), CD42b-PEcy7 cl. HIP1 
(303916; Biolegend, 1:100), CD10-AF700 cl. HI10a (563509; BD, 1:100), 
CD33-PE cl. WM53 (555450; BD, 1:50), CD24-BV711cl. ML5 (563401; BD, 
1:100), CD20-BV650 cl. 2H7 (563780; BD, 1:100). 

Dead cells were excluded with 7AAD (BD Biosciences, used at 1:50). 
For assessment of cell-cycle stages or apoptosis, 7AAD was combined 
with BrdU-PE (556029; BD, 1:100) or annexinV-PE (556422; BD, 1:50), 
respectively (see below). 


RNA isolation, cDNA synthesis and qRT-PCR 

RNA isolation was performed using the RNeasy Mini kit (Qiagen) with 
additional DNase step using manufacturer’s protocol. cDNAs were pre- 
pared using High-Capacity cDNA Reverse Transcription Kit (Thermo- 
Fisher), and qPCR for GAPDH, MLLT3, MECOM and HLF was performed 
with the LightCycler 480 SYBR Green I Master Mix (Roche) or TaqMan 
Gene Expression Master Mix on the Lightcycler 480 (Roche). Primers 
are presented in Supplementary Table 5. 


Production of lentiviral shRNA and overexpression vectors 
shRNA experiments were performed with pLKO lentiviral vec- 
tors from the TRC library containing puromycin resistance gene. 
shRNA TRCN0000005793 (93, Sigma) was selected for in vitro 
and in vivo experiments after testing for knockdown efficacy com- 
pared to other shRNAs against MLLT3 from the RNAi Consortium 
(TRCNO000005790 to TRCN0000005794) series. TRCNO000014790 
and TRCNO000002528 were used to knockdown HLF and MECOM, 
respectively. snRNAs were tested on the K562 and Hep2G cell lines, 
obtained by ATCC and mycoplasma-free. 

Human MLLT3 was cloned from human FL-HSPC full-length cDNA into 
the constitutive FUGW (Addgene plasmid 14883, from D. Baltimore) 
lentiviral vector, in which its expression is controlled by the UBC pro- 
moter. MLLT3 cDNA witha C-terminal V5-tag was inserted downstream 
and in frame with the GFP sequence with the synthetic addition a P2A 
sequence between the 2 ORFs, through two rounds of PCR (Supple- 
mentary Table 4) using PfuUltra II Ultra Fusion HS (Agilent). 

For lentiviral vector production, 20 million 293T cells were co-trans- 
fected with deltaR8.2 packaging plasmid, VSVG-envelope plasmid, the 
lentiviral vector plasmid of choice and Turbo DNAfectin 3000 (Lamda), 
following manufacturer’s instructions, in Opti-MEM (Life Technologies) 
and incubated for 5-6 h at 37 °C. After incubation for 48 hin Ultracul- 
ture medium (Lonza), supernatant was filtered and concentrated by 
ultracentrifugation and pelleted viruses were resuspended in SFEM 
and stored at -80 °C. 


Lentiviral transduction 

Sorted fetal liver and cord blood HSPCs were prestimulated 24 hin 
StemSpan SFEM II supplemented with SCF, FLT3-L, TPO and antibiotic- 
antimycotic (LT). Wells were pre-coated with 40 pg ml RetroNectin 
(Takara) and seeded with pre-stimulated HSPC in 300 pl SFEM. Len- 
tivirus was added twice, first after 24 h from cell seeding and again 
after additional 8 h. After 48 h from cell seeding, transduced cells 
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were washed and seeded in the indicated conditions for each assay. 
For shRNA-lentiviral vectors, puromycin (1.0 pg ml“) treatment was 
used for selection of transduced HSPCs and maintained throughout 
culture. Cells infected with overexpression vectors were selected by 
cell sorting for GFP expression. 


Haematopoietic differentiation assays 

FL-HSPCs sorted directly or after 4 weeks of expansion on OP9M2 were 
plated on differentiation assays. For myeloid assay, FL-HSPCs were 
plated in StemSpan SFEM II (SCT) supplemented with G-CSF (20 ng 
ml™, Peprotech), GM-CSF (20 ng mI, Peprotech), TPO (25 ng mI’) 
and SCF (25 ng ml’). For the erythroid assay, HSPCs were plated in 
StemSpan SFEM II supplemented with IL-3 (20 ng mI, Peprotech), 
EPO (2.5 1U, Thermo Fisher Scientific), SCF (25 ng ml’), L-glutamine 
and antibiotic-antimycotic. For the megakaryocyte differentiation 
assay, HSPCs were plated in IMDM supplemented with recombumin 
(0.4% Albumedix), 2-mercaptoethanol (100 uM, Gibco), SCF (100 ng 
ml), TPO (50 ng mI), IL-3 (10 ng mI), IL-6 (7.5 ng mI, Peprotech), IL-9 
(13.5ng mI, Humanzyme), human low-density lipoprotein (4 pg mI", 
SCT), ITS-X (Gibco), glutamax and antibiotic-antimycotic. For the T cell 
assay, HSPCs were plated on non-irradiated OP9-DLL1 stroma (25,000 
cells cm”) in OP9 medium supplemented with SCF (25 ng mI), FLT3-L 
(10 ng mI) and IL-7 (20 ng mI“, Peprotech). For the B cell assay, HSPCs 
were seeded on irradiated OP9M2in MEM-alpha5% FBS supplemented 
with SCF (25 ng mI’), FLT3-L (10 ng ml) and IL-7 (20 ng mI”). In each 
assay, cells were cultured in 24-well plates with 1 ml of the indicated 
medium and half medium changes were applied every 2-3 days. Cells 
were analysed by flow cytometry after 2 weeks. 


Cell cycle and apoptosis assays 

For the BrdU incorporation analysis in vitro, control- or MLLT3-OE- 
transduced FL-HSPCs were cultured for 4 weeks on OP9M2 with 500 
nM SR1 and 35 nM UMI171 and pulse-labelled with 10 uM BrdU for 35 
mininculture. Cells were sorted for the indicated surface phenotypes 
and processed according to the PE-BrdU flow kit (BD) instructions and 
analysed by flow cytometry. Apoptosis from FL-HSPCs cultured inthe 
same conditions was assessed by flow cytometry using annexin-V-PE 
(BD) and 7AAD incorporation, following manufacturer instructions. 


Transplantation assays in NSG mice 

In vivo reconstitution ability of cultured HSPCs was assessed in immu- 
nodeficient mice. Female NSG (Jackson Laboratories) mice, 8-12 weeks 
old, were sub-lethally irradiated (2.75 Gy) and retro-orbitally injected 
with cells derived from fetal liver or cord blood in a volume of 100 ul 
of RPMI. 

For MLLT3 shRNA knockdown experiments, 30,000 fetal liver CD34* 
cells were infected with MLLT3 shRNA or control lentivirus and cultured 
onOP9M2for 9 days under puromycin selection before transplantation. 
At 12 weeks, MLLT3-KD mice were euthanized to obtain bone marrow. 

For MLLT3-OE experiments with FL-HSPCs, mice were transplanted 
with the progeny of 1,000 sorted GPI80* FL-HSPCs (CD34*CD38"° 
CD90*GPI80°). Cells were transduced with the MLLT3-OE or control 
vector and re-sorted after 5 days for GFP*CD34* cells. These cells were 
either injected retro-orbitally immediately after sorting (day 5) or 
expanded on SFEM for an additional 10 days (day 15) before injection. 
After 24 weeks, mice blood was collected, and mice were euthanized 
to obtain bone marrow and spleen. Collected cells were analysed by 
FACS to evaluate human engraftment (human CD45 and absence of 
mouse CD45). Reconstituted mice were defined by the presence of GFP* 
(MLLT3-OE and control vector) human CD45 cells that differentiated to 
myeloid and lymphoid lineages 24 weeks after transplantation. Differ- 
entiation into myelo-lymphoid lineages was evaluated by the detection 
of myeloid (CD14 or CD66b), B-lymphoid (CD19) and T-lymphoid (CD3, 
CD4 and CD8) markers on human CD45‘ cells. Of note, T-lymphoid 
engraftment was evaluated only at 24 weeks from transplantation, 


owing to the absence of reliable T-lymphoid differentiation at week 12 
(MLLT3-KD, Extended Data Fig. 1k). Preservation of the HSPC compart- 
ment (CD34*CD38~") was also recorded. 

For BrdU incorporation analysis in vivo, NSG mice were transplanted 
with high doses of human FL-HSPCs (8,000 GPI80* HSPCs or their prog- 
eny at culture day 15 after transduction with control or MLLT3-OE vec- 
tor). Then 14 weeks after transplantation, 2 mg of BrdU was injected 
intraperitoneally. Mice were euthanized 100 min after and bone marrow 
was collected. Cells were sorted for the indicated surface phenotypes 
and processed according to the PE-BrdU flow kit (BD) instructions and 
assessed by flow cytometry. 

For secondary transplantations, viably frozen bone marrow from pri- 
mary NSG mice transplanted with 15-day cultured HSPCs or uncultured 
HSPCs was used. Bone marrow from mice from the same experimental 
group was pooled and a dose equivalent of 1/2 femur of the primary 
mouse total bone marrow was injected retro-orbitally in sub-lethally 
irradiated secondary recipients. Human engraftment was assessed in 
bone marrow after 18 weeks from transplantation. 

For the limiting dilution assay, CB-HSPCs were injected after FACS 
sorting, after transduction and sorting transduced cells (GFP*CD34", 
day 5), or after an additional 10 days of expansion in culture (day 15). 
The number of cells reported (2,000, 500, 50, 10) is the number of 
sorted HSPCs (uncultured) that was transplanted directly, or the prog- 
eny of which was transplanted after culture (days 5 and 15). The criteria 
for multilineage engraftment was having hCD45 positive (GFP* with 
MLLT3 and CTR vector) cells that displayed at least one myeloid (CD14 
CD66b or both) and the B-lymphoid (CD19) marker. HSC frequency was 
assessed using ELDA software”’. 

Apanel of markers covering the classical AML blast (CD34 and CD33) 
and B-cell precursor markers (CD34, CD10, CD20, CD24, CD38) altered 
in MLL-MLLT3 fusion gene-driven leukaemia in immunodeficient mice 
were tested in mice transplanted with day-15-expanded MLLT3-OE CB- 
HSPCs and uncultured CB-HSPCs. 

Alltransplanted mice were included in the analysis unless they died 
before the experimental endpoint (in total from combined fetal liver 
and cord blood experiments, 14 out of 115 mice transplanted with 
MLLT3-OE, 17 out of 98 mice transplanted with control vector HSPCs, 
and 3 out of 36 mice with uncultured HSPCs died before 24 weeks). 

All studies and procedures involving mice were conducted in compli- 
ance with all the relevant ethical regulations and were approved by the 
UCLA Animal Research Committee (protocol 2005-109). 


ChIP-seq analysis 

Sorted FL-HSPCs or erythroblasts (CD34 CD235*°CD71°) (50,000- 
10,0000 per immunoprecipitation) were crosslinked in 1% formalde- 
hyde for 10 min, quenched with glycine 0.0125 M and snap-frozen as 
a dry pellet. The pellet was re-suspended in lysis buffer (SO mM Tris, 
pH 8.2, 10 mM EDTA, 1% Triton X-100, 0.1% sodium deoxycholate, 
0.5% sarkosyl) and sonicated 12 min at a 5% intensity using Misonix 
cup-horn sonicator. Chromatin was incubated overnight with 2 pg of 
antibody (anti-MLLT3, Genetex, GTX102835; anti-V5, Abcam, ab15828; 
H3K4me3, Abcam ab8580; H3K9ac, CST 9649; H3K9cr, PTM PTM-516; 
H3K36me3, Abcam ab9050; H3K79me2, Abcam, ab3594; RNA-polll 
Rbp1 (8WG16) Biolegend 664911, H3K27ac, AM 39133; H3K4mel1, AM 
39297; H3K27me3, AM 39155; H3K9m3, AM 39161) preloaded onto 20 
pl of Protein G Dynabeads (Thermo Fisher) and washed twice with each 
of the solutions (low-salt wash, high-salt wash, LiCl wash and TE buffer) 
as previously described*?. 

ChIP experiments for H3K79me2 were also performed from short- 
term cultures with FL-HSPCs transduced with either MLLT3-KD (no. 93) 
or MLLT3-OE vector and respective controls. Knockdown was collected 
at day 5, which included 3 days of puromycin selection. MLLT3-OE 
were sorted for GFP and collected at day 10, which included 4 days of 
EPZ5676 treatment at 500 nM, or DMSO as accontrol. Sonicated chro- 
matin from these chips was spiked-in with S2 chromatin (Activ Motif) 


following manufacturer instructions. Libraries were prepared with 
the Nugen Ovation Ultralow kit v2 following manufacturer instruc- 
tions and sequenced using HiSeq-4000 (Illumina) to obtain single-end 
50-bp long reads. Demultiplexing of the reads based on the barcod- 
ing was performed using in house Unix shell script. Mapping to the 
human genome (hg19) was performed using bowtie2. Samtools v.1.3.1 
package was uses to create a.bam file, remove duplicates, blacklisted 
region and ChrM regions, sort and index. Bedtools multicov was used 
to quantify histone mark signals. GO analysis on genes bound by MLLT3 
was calculated using Homer annotatePeaks.p|l” using -go function and 
statistic is reported as Pvalue, calculated with standard parameters by 
the algorithm. Each H3K79me2, H3K4me3, H3K36me3 and H3K9ac 
ChIP replicate on MLLT3-OE (Fig. 3f) is internally normalized using the 
non-MLLT3-bound, housekeeping gene VCL (Extended Data Fig. 8b). For 
H3K79me2 ChIP samples spiked in with S2 chromatin, sequences were 
aligned to both the human and Drosophila melanogaster genome (dm6) 
and the quantified reads, were normalized by total reads aligned to dm6 
for each sample (Fig. 3g). Coverage files, average profiles heat maps 
were created with Deeptools packages”. MACS2 v.2.1.1” was used to 
call MLLT3 peaks using default parameters for broadPeak calling. 


ATAC-seq analysis 

FL-HSPCs or erythroblasts (CD34 CD235*CD71°) (50,000 sorted cells) 
were processed according to the protocol™, with minor adjustments. 
Nuclei were purified by the addition of 250 ul of cold lysis buffer (10 
mM Tris-HCl, pH 7.4, 10 mM NaCl, 3 mM MgCl,, 0.1% IGEPAL CA-630) to 
sorted cells, pelleted and resuspended in the transposition reaction mix 
(Nextera DNA Library Prep Kit, Illumina) and incubated at 37 °C for 30 
min. Transposed DNA was column purified and used for library ampli- 
fication with custom made adaptor primers* using NEBNext High- 
Fidelity 2x PCR Master Mix (New England Labs). The amplification was 
interrupted after 5 cycles and a SyBR green qPCR was performed with 
1/10 of the sample to estimate for each sample the additional number 
of cycles to perform before saturation was achieved. Total amplifica- 
tion was between 10 and 15 cycles. Purified libraries were sequenced 
using HiSeq-2000 (Illumina) to obtain paired-end 50-bp long reads. 
Read-mapping to the genome (hg19) was done using Bowtie2 or v.2.2.9°° 
with parameters-local -X 2000 -N 1-no-mixed. The Bamcoverage tool 
from Deeptools was used to create the coverage .bw files for visuali- 
zation*. Samtools v.1.3.1 was used to remove duplicates and reads 
aligned to chrM. 


RNA-seq analysis 

Total RNA from 50,000 sorted HSPCs was extracted using the RNeasy 
Mini kit (Qiagen) and library was constructed using KAPA RNA Hyper- 
Prep Kit with RiboErase (HMR). Libraries were sequenced using HiSeq- 
4000 (Illumina) to obtain paired-end 50-bp long reads. Mapping to 
the human genome (hg19) was performed using TopHat v.2.0.9 or 
v.2.0.14* with the parameters—no-coverage-search -M -T -x 1. Cover- 
age files were created with the Genomecov tool from Bedtools” with 
the parameters -bg -split -ibam. For abundance estimations (FPKMs) 
the aligned read files were further processed with HOMER coupled to 
edgeR on the hg19 annotation. GO was calculated using DAVID** and 
statistic is reported as Pvalue, calculated with standard parameters by 
the algorithm. Gene expression changes were considered significantly 
up- and downregulated when adjusted P< 0.05 (-log-transformed fold 
change > 0.322). The selection of differentially expressed genes is based 
on adjusted P value (significant across six replicates) rather than fold 
change because, as MLLT3 is required to maintain HSC identity and 


viability, the changes observed in the HSPC compartment reflect the 
beginning of MLLT3-dependent processes, not the end point. Heat 
maps were generated using Morpheus (Broad Institute) using a colour 
scale normalized by the minimum and maximum value for each gene. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| MLLT3 expression in human haematopoietic tissues 
and after shRNA knockdown. a, Candidate human HSC self-renewal factors 
that correlate with the self-renewal of human FL-HSCs (Fig. 1a) and are localized 
inthe nucleus are listed. b, Bar graphs showing mean and individual probe 
values for MLLT3 from microarray datasets after differentiation of 
CD34*CD387"°CD90*GPI80* FL-HSCs to CD34*CD387“°CD90*GPI8O~ 
progenitors’, in FL-HSPCs after 5-week culture on OP9M2 stroma‘, and inhES 
cell-HSPCs°(n =3;n=2 for 5-week culture and hES cell-HSPCs). c, Quantitative 
PCR with reverse transcription (qRT-PCR) validation of MLLT3 expressionin 
human fetal liver, cord blood and bone marrow HSPCs and downstream 
progenitors (n=2 donors per tissue). Expression is relative to GAPDH, 
normalized to the first sample of each plot. Mean and individual values are 
shown. d, UCSC genome browser tracks of RNA-seq analysis of sorted human 
developmental haematopoietic populations”. First trimester (5 weeks) 
embryonic aorta-gonad-mesonephros (AGM) endothelium 
(CD34*CD90*CD43 ) and HSPC (CD34*CD90*CD43"*) population from one 
donor are compared to second trimester (17-week) FL-HSPCs (CD34*CD387"° 
CD90*CD45"*). Expression of transcription factors involved in the development 
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of haemogenic endothelium and HSC specification are compared to MLLT3.e, 
f, Flow cytometry analysis (e) and quantification (f) of GFP fluorescence in 
human K562 cells (haematopoietic cell line with no detectable MLLT3 
expression) stably expressing MLLT3-OE vector (asin Extended Data Fig. 2a), 
that were transduced with five different shRNAs targeting MLLT3 (sh_90-94), 
representative of two transductions. shRNA-mediated knockdown targeting 
MLLT3 also lowers GFP that is transcribed froma single GFP-P2A-MLLT3 
transcript. Median fluorescence intensity (MFI) fromn=2transductions (mean 
and individual values). g, Strategy for MLLT3 lentiviral knockdown and 
functional analysis in FL-HSPCs. h, FACS plots 5 and 15 days after MLLT3-KD 
(representative of three experiments). i,j, MLLT3 qRT-PCR S days after 
transduction (i) and quantification of CD34*CD38”” (j) incells transduced 
with empty vector or MLLT3 shRNA (KD92 and KD93) after 5 and 15 daysin 
culture (mean and individual values, n=3 donors). k, Representative FACS plots 
from NSG mouse bone marrow 12 weeks after transplantation assessing human 
CD45* cells, multi-lineage haematopoietic reconstitution and HSPCs 
(extended from Fig. 1d, n=10 mice, 2independent experiments). 
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Extended Data Fig. 2| Effects of MLLT3-OE on HSPC expansion. a, FUGW of FUGW empty-vector-transduced and MLLT3-OE-transduced GPI80* HSPCs 


lentiviral construct expressing MLLT3 and GFP. P2A, cleavage sequence; UbC, 
ubiquitin C promoter; VS, V5 peptide tag. Representative flow cytometry plots 
showing persistent GFP expression in FL-HSPCs after transduction with control 
or MLLT3-OE vector, representative of six experiments. b, (RT-PCR 
quantifying MLLT3-OE in transduced HSPCs after one week (mean, n=3).c, The 
effects of MLLT3-OE in GPI80* (red) and GPI80° (blue) FL-HSPC progenitors. 

d, Representative FACS plots derived from GPI80* HSPCs (top) and GPI8O™ 
progenitors (bottom) expressing control and MLLT3-OE vectors. e, Expansion 
of total live cells (left) and HSPCs (right) expressing control or MLLT3-OE 
vectors (mean and s.e.m.,n=3).f, Strategy for lentiviral overexpression of 
MLLT3 in FL-HSPCs seeded with or without OP9M2 stroma to assess their 
dependence onasupportive microenvironment. g, Representative FACS plots 


seeded on OP9M2 (top) or stroma-free culture wells. h, Expansion of total live 
cells (left) and CD34*CD387° CD90* HSPCs (right) transduced with control or 
MLLT3-OE vectors (mean ands.e.m.,n=3 independent experiments). Culture 
of MLLT3-OE HSPCs with small molecules UM171 and SR1.i, Expansion of total 
live cells (top) and CD34*CD387'°CD90* HSPCs (bottom) transduced with 
control or MLLT3-OE vectors (meanands.e.m.,n=2 independent 
experiments), cultured in SFEM with SR1 and UM171.j, Expansion of total live 
cells (top) and CD34*CD387"°CD90* HSPCs (bottom) transduced with control 
or MLLT3-OE vectors (mean ands.e.m.,n=3 independent experiments), 
cultured in HSC medium on OP9M2 with SR1and UMI171. Pvalues ine, handj 
determined by two-sided t-test. 
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Extended Data Fig. 3 | Effects of MLLT3-OE on cell cycle, viability and 
differentiationin culture. a, Strategy to test cell-cycle activity, apoptosis and 
differentiation potential of MLLT3-OE-expanded HSPCs. b, BrdU incorporation 
flow cytometry analysis of FL-HSPCs cultured for 4 weeks on OP9M2 withSR1 
and UM171. GO/G1(BrdU-AAD"), S (BrdU*7AAD™) and G2/M (BrdU 7AAD") 
phases are quantified in FL-HSPCs (mean ands.e.m.,n=5).c, Apoptosis analysis 
of HSPCs cultured for 4 weeks on OP9M2 with SR1and UMI171. Early 
(AnnV‘7AAD ) and late (AnnV’7AAD‘") apoptosis are quantified by flow 


cytometry (mean ands.e.m.,n=2 uncultured, n=4 control and MLLT3-OE). 

d, FACS analysis of in vitro differentiation (representative of three 
experiments). e, Quantification of the differentiation of 4-week expanded 
HSPCs into myeloid, erythroid, megakaryocytic, B- and T-lymphoid lineages, 
compared with the differentiation of uncultured FL-HSPCs (mean, n=3 donors 
except for megakaryocytic n=4 and B-lymphoid uncultured n= 2). All Pvalues 
determined by two-sided t-test. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | Analysis of human reconstitution and correlation of 
cell cycle and culture time with the engraftment of MLLT3-OE cells. 

a, Transplantation strategy with ex vivo expanded FL-HSPCs transduced with 
control or MLLT3-OE vector. On day 5, transduced CD34*GFP* cells derived 
from1,000 HSPCs were sorted and replated in SFEM and transplanted into NSG 
mice at day 15. Human haematopoietic reconstitution was assessed at 

24 weeks. Bone marrow from engrafted mice was transplanted to secondary 
recipients. b, FACS analysis showing human haematopoietic reconstitution 
(hCD45) (Fig. 1h) and multilineage differentiation (myeloid CD66, CD14; 
B-lymphoid CD19; T-lymphoid CD3, CD4, CD8).c, d, Representative FACS plots 
(c) and quantification (d) of human haematopoietic reconstitution from 
peripheral blood of transplanted NSG mice (extended from Fig. 1h). 

e, Quantification of human haematopoietic reconstitution from mouse spleen. 
Number of mice showing multi-lineage reconstitution versus total 
transplanted mice is shown (n=13 or 14 mice, mean and individual values from 
four independent experiments). f, FACS sorting strategy for cell-cycle analysis 
after 100 min in vivo BrdU pulse in haematopoietic cells derived from human 
FL-HSPCs (8,000 GPI80* FL-HSPCs or their progeny day 15 of culture after 
transduction with control or MLLT3-OE vector) transplanted in NSG mice and 
sorted 14 weeks after transplantation. g, h, Representative FACS plots (g) and 


quantification (h) of cell-cycle distribution by BrdU and DNA content (7AAD) 
staining. GO/G1 (BrdU-7AAD"), S (BrdU*7AAD™) and G2/M (BrdU-7AAD") 
phases are quantified in HSPCs, CD66b* myeloid and CD19* B-lymphoid cells 
(mean, n=2).i,j, Cell-cycle analysis in HSPCs before transplantation, after 
2-week culture in SFEM with SR1 and UMI171. All Pvalues determined by two- 
sided t-test. k, Transplantation strategy assessing the effect of extended 
culture on the reconstitution ability of MLLT3-OE FL-HSPCs. FL-HSPCs 
transduced with control or MLLT3-OE vector were sorted on day 5, and 
CD34*GFP* cells derived from 1,000 uncultured HSPCs were transplanted 
directly into NSG mice or cultured in SFEM for an additional 10 days before 
transplantation. Numbers show mice with multilineage human engraftment, 
transplanted before (day 5) and after (day 15) culture expansion.1,m, Human 
CD45" cells (1) and human HSPCs (m) in mouse bone marroware quantified 
(n=4 independent experiments; number of mice is indicated). *P< 0.05, two- 
sided t-test. n, o, Representative FACS plots (n) and quantification (o) of human 
haematopoietic (CD45) reconstitution and multilineage differentiation in 
mouse bone marrowafter 18 weeks of secondary transplantation (mean, two 
independent experiments). Secondary mice were transplanted with 2-week-old 
bone marrow from primary recipients (1/2 of femur of the primary transplanted 
mice injected per recipient). 
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Extended Data Fig. 5 | MLLT3 binding and correlation with epigenetic 
marks. a, Heat map showing k-means clustering for MLLT3 ChIP-seq peaks and 
their association with ATAC-seq and histone marks in uncultured FL-HSPCs. 
ChIP of MLLT3 in K562 cells is shown as acontrol. The region surrounding the 
MLLT3 peak centre +5 kb is shown. Peak distance from the gene TSSs and 
representative high-ranked biological processes enriched in GREAT analysis 
are shown for each cluster. Cluster 6 genes, in which the MLLT3 signal does not 
correlate with active marks, show no significant enrichment for biological 


4 812 5 
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processes. b, Average profile and heat map for MLLT3 and indicated histone 
marks in expressed and non-expressed genes in uncultured FL-HSPCs. 
Metagene plot +2 kbis shown. This represents an extended version of Fig. 2c.c, 
Heat map for selected gene groupsas inb. This represents an extended version 
of Fig. 2e. Tracks ina-c are representative of n=3 experiments for MLLT3 ChIP 
in FL-HSPCs, n=1 for H3K9cr, H3K27me3 and H3K9me3, and n=2 for other ChIP 
experiments. 
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Extended Data Fig. 6 | Cell-type specificity of MLLT3 binding. a, UCSC 
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genome browser tracks showing ChIP-seq of MLLT3 and epigenetic marks in 
representative MLLT3-bound genes in FL-HSPCs and FL-erythroblasts (n=3 
MLLT3 ChIP in FL-HSPCs, n=1 for H3K9cr H3K27me3 and H3K9me3, andn=2 
for other ChIP experiments). This represents an extended version of Fig. 2g. b, 
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Heat map with hierarchical clustering for genes bound by MLLT3 in FL-HSPCs 
(n=2) and/or CD34 GlyA*CD71* FL-erythroblasts (n= 2) (genes with significant 
MLLT3 peak <5 kb from TSS) showing clusters of common and cell-type- 
specific MLLT3-bound genes. c, Distribution of MLLT3 FL- erythroblast peaks. 
d, GO analysis of MLLT3-bound genes in FL-erythroblasts. 
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Extended Data Fig. 7| See next page for caption. 
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Extended Data Fig. 7 | MLLT3 binding and regulation of MLLT3 target genes 
in cultured HSPCs. a, Heat map showing distribution of MLLT3 peaks 
reflecting endogenous MLLT3 and MLLT3-OE ChIP-seq signal using MLLT3 and 
V5 antibodies. MLLT3 from uncultured FL-HSPCs and input control, MLLT3 
ChIP signal from 4 week cultured FL-HSPCs with MLLT3-OE or empty control 
vector, and V5 ChIP from FL-HSPCs cultured for 4 weeks with MLLT3-OE or 
control vector. The region surrounding the peak summit (+ 2 kb) is shown for 
the combined MLLT3 peaks from all conditions (FL-HSPC MLLT3 ChIP n=3, 
MLLT3-OE V5 ChIP n=2 and MLLT3-OE MLLT3 ChIP n=1). IP, 
immunoprecipitate. b, Heat map showing hierarchical clustering for all MLLT3- 
bound genesas ina. Signal from all MLLT3-bound genes (significant MLLT3 
peak at <5kb from TSS) is shown. c, UCSC genome browser tracks of 
representative MLLT3-bound HSC (yellow) and erythroid (red) genes. Tracks 
show ChIP-seq signal for input and MLLT3 in uncultured FL-HSPCs, MLLT3 
ChIP-seq on FL-HSPCs cultured for 4 weeks with control and MLLT3-OE 


vectors; V5-tagged MLLT3 ChIP-seq from FL-HSPCs cultured for 4 weeks with 
control and MLLT3-OE vectors, and MLLT3 ChIP-seq in FL-erythroblasts. 

d, Probe values from microarray datasets‘ for MLLT3 and MLLT3-bound and 
upregulated HSC genes (see Fig. 3) documenting the decline of gene 
expression over time in culture (day 0 (n=3) 2 weeks (n=3) and 5 weeks (n= 2) 
on OP9M2, mean and individual values). e-g, Knockdown of MLLT3 targets 
MECOM and HLF in MLLT3-OE cells. e, FACS plots at 15 and 30 days after 
MECOM-KD and HLF-KD, with or without MLLT3-OE (representative of three 
experiments). f, Quantification of CD34*CD387"° CD90* HSPCs (percentage of 
live cells) in MECOM-KD and HLF-KD FL-HSPCs transduced with empty vector 
or MLLT3-OE after 5,15, 30 and 42 days in culture (mean, n=3). Pvalues 
determined by two-sided t-test. g, Quantification of HLF downregulationin 
HepG2 cells by qRT-PCR. h, Quantification of MECOM downregulation in K562 
cells by (RT-PCR. 
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Extended Data Fig. 8 |Examples of MLLT3-mediated gene regulation after 
MLLT3-OE, MLLT3-KD and DOTIL inhibition. a, UCSC genome browser tracks 
of representative MLLT3-OE upregulated or downregulated genes. From top to 
bottom: MLLT3 ChIP-seq signal in FL-HSPCs cultured for 4 weeks with empty 
vector or MLLT3-OE; RNA-seq tracks; ChIP-seq for RNA Pol Il and epigenetic 
marks incultured control and MLLT3-OE FL-HSPC. Tracks are generated from 
the merged signal from n=5 H3K79me2 ChIP, n=2 other ChIPs. b, UCSC 


genome browser tracks of the indicated ChIP experiments inrepresentative 
genes in MLLT3-KD and control vector after 5-day culture, MLLT3-OE, MLLT3- 
OE plus EPZ5676 (DOTIL inhibitor), or control plus EPZ5676 and control vector 
after 10-day culture, and MLLT3-OE versus control from 4-week culture. Genes 
from Fig. 3g are indicated in bold. The genomic region adjacent to MSI2is 
included to show genes not bound by MLLT3 (that is, PCPT and VCL). 


Article 


a 
Cord blood — MLLT3-KD 


Culture day 5 Culture day 15 
CTR MLLT3-KD CTR MLLT3-KD 


KR 
> 
o 
Ww 
o 
2) 
iS) 
Qa 
(S) 


uw 
Qo 
& 
2 
©. 
CD90-APC 
b CD34*CD38-"ow HSPC 
a MICIR 
3 © fIMLLT3-KD 
2 20 
ite) 
tT 
a 
O 10 
‘Oo 
x 


5 15 30 “5 15-30 
Days in culture Days in culture 


Extended Data Fig. 9 | Effects of MLLT3 knockdown and overexpressionon 
CB-HSPC expansion in vitro. a, FACS analysis, 5 and 15 days after MLLT3-KD 
in CB-HSPCs (representative of two experiments). b, Quantification of 
haematopoietic subsets CD34*CD387" and CD34*CD387"°CD90* (HSPCs) in 
empty vector (CTR) and MLLT3-KD-transduced cells after 5,15 and 30 daysin 
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culture (mean, n=2). Circles represent individual values. c, FACS analysis of 
CB-HSPCs transduced with control or MLLT3-OE vectors cultured on OP9M2 
(representative of two experiments). d, Expansion of total live cells (left) and 
CD34*CD38~" CD90 HSPCs (right) transduced with MLLT3-OE or control 
vectors. 


a Blood — 24 weeks b 


Human Myeloid Lymphoid T-lymphoid Human Blood 
nCD45* hCD45* 
hCD45 * Gbta-cpash: Co -Dea@b: hCD45*  Uncultured 
= 10. 08s | 0.68 MICTR 
x a: nso B MLLT3 -OE 
> : 100 ns ns ns ns 
2 Salt. 0,080.04 psps sng psps, 
: me BS 
1.9 11.7 
robe o 
= 2 
9 8 
Oy 2 
BE rs] 
oO I re 
e A % 
= < 3s 
LU = Jy 95: 1S) NX 
os 10 | 
Sal eed ads 5 : 
Fe a a a 
aA} o 18) Q a 
290 a O) tee ekeeor te ——__* ———— 
= 0 10°10419° © 10° 10* 19° 0” 40* 10° 010° 10# 40° 010° 104 19° Myeloid Lymphoid 
mCD45-APCcy7 CD14-BV510 CD19-BV605 CD4-APC GFP 
c Blood — 24 weeks 
Day 0 Day5 — Day 15 | Bone Marrow 
After transduction After transduction g 
400 expansion uncultured 
a 
no mn ; 400, MLLT3-OE 
10g ay ‘ +. 7) 
b a aA a s er @ 80 
A sv " wo, i + i 4 a 4 ip 
iS) . aa* 60 
Sods 4 nN = 4 a aA + 
7 fe a ye ; ras i ae . 8 40 
0.01 _ aA 4k i a a = 
4 = i : 2 
9 20 
0.001 xe i 
cell dose s ge & © ese vr EF & SEL Es w& 0 rn 
Uncult. CTR MLLT3-OE CTR MLLT3-OE 3 3 = 
d Spleen — 24 weeks h Oo 0 0 
100g a ry awa 
- = a. = 2 100 
+ 10 i a— & 8 
wo a a = ry A = eS 
a 14 aa a a a a a A 2 
2 ‘ = 4 3 at 4 Q 
Ss O1g 4 —a 4 Pare “oA + 
= a a ear’ ~ Vea: re na 50 
0.01 Ba BN fae oe & a * Pay 
— a 
A x 2 a 4 2) 
= 
cell dose s ge & © Seer gs & SEL Es & ie odeaéa PIE 
—_—_ —=—_- --——_— —=——_ -—_——=— 3 + + + + + 
Uncult. CTR  MLLT3-OE CTR MLLT3-OE ~eoox Bg 
86668 
e Secondary Transplant - BM 18 weeks 
Human Myeloid Lymphoid J-lymphoid Human f 
" hCD45* ACD45* 
hcD45 CD14-CD66b- CD 14-CD66b- Secondary 
0 oO 0 BM 
o 33 7/1 
| = 
2 Ee 
5 100 ° fy 


% hCD45+ 


Cord Blood 
CTR 


2 


i=} 


MLLT3-OE 
hCD45-BV711 
CD3-PEcy7. 


I 


0 10° 104 10° 0 10° 104 10° 0 40° 10108 0 10°10*108 0 109104105 
mCD45-APCcy7 CD14-BV510 ED19-BV608 + CD4-APC GFP 


CD66b-BV421 


Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | Comparison of the function of MLLT3-OE cells and 
uncultured cells in transplanted mice. a, FACS analysis of total human 
haematopoietic reconstitution in peripheral blood from mice transplanted 
with uncultured CB-HSPCs (CD34*CD38 CD90‘) or CB-HSPCs transduced with 
empty GFP vector (CTR) or MLLT3-OE vector after 15 days of culture, asin Fig. 4f 
(representative of three experiments). Plots show human CD45 
haematopoietic reconstitution and multilineage differentiation (myeloid 
CD66, CD14; B-lymphoid CD19; T-lymphoid CD3, CD4, CD8), and GFP inhCD45* 
cells. b, Quantification of monocytes (CD14*), granulocytes (CD66b’), B cells 
(CD14 CD66b CD19") and T cells (CD14 CD66b CD4‘* and/or CD8"*) asa 
percentage of human CD45‘ cells in engrafted NSG mice 24 weeks after 
transplantation (mean, n=3 experiments). Pvalues determined by two-sided 
t-test. Quantification of human haematopoietic reconstitution in the blood of 
NSG mice 24 weeks after transplantation in limiting dilution, as in Fig. 4f. 


Percentage of hCD45‘ cells in peripheral blood (c) and spleen (d) (mean, n=3 
experiments). e, f, FACS plots (e) and quantification (f) of total human CD45* 
haematopoietic reconstitution after 18 weeks of secondary transplantation of 
cord blood (uncultured, day-15-expanded control and MLLT3-OE HSPCs) in 
mouse bone marrow (mean, n=2 experiments). g,h, Quantification of human 
reconstitution in engrafted mice after 24 weeks of primary transplantation. 
Apanel of markers covering the classical AML blast (CD34 and CD33) and B-cell 
precursor markers (CD34, CD10, CD20, CD24, CD38) were tested in day 
15-expanded MLLT3-OE CB-HSPCs, comparing their distribution to mice 
engrafted with uncultured CB-HSPCs (n=5 mice from two independent 
experiments, mean and individual values). Haematopoietic markers included 
CD33* myeloid, CD19* lymphoid, and B-cell precursor markers CD34* CD10*, 
CD20*, CD24" and CD38° within the CD19" population. 
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Life sciences study design 
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Sample size For each experiment, a sample size was chosen to obtain sufficient number of experiments and samples (different human donor tissues) to 
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Data exclusions No collected data were excluded from the analysis. 
Replication Data presented were replicated in several independent experiments and by different investigators 
Randomization — Allocation to samples or mice to the experimental groups was random. 


Blinding Investigators were not blinded. Data collection for all experiments was automated (e.g. flow cytometry, sequencing etc.) and data 
interpretation was based on appropriate controls rather than subjective assessment by investigators. 
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Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
|] Palaeontology [| MRI-based neuroimaging 
Animals and other organisms 
Human research participants 
Clinical data 
Antibodies 
Antibodies used human-CD45-BV711 or -BV785 cl.HI30 (304050, 304048; Biolegend, 1:100), mouse-CD45-APC-H7 cl. 30-F11 (557659; BD, 1:100), 
CD34-APC cl. 581 (555824; BD, 1:20) or -BV605 (343529; Biolegend, 1:20), CD90-FITC cl. 5E10 (555595; BD, 1:100) or -APC 
555595; BD, 1:100), CD38-PE-Cy7 or -BUV496cl. HIT2 (560677 and 564657; BD, 1:100), CD19-PE or BV605 cl. 1D3 or HIB19 
12-0193, 12-0199; eBiosciences, 1:50), GPI-80-PE cl. 3H9 (DO87-5; MBL, used at 1:50), CD3-PE-Cy7 cl. SK7 (557851, BD) 
eBiosciences, 1:50), CD4-APC (MHCD0405; Invitrogen, 1:50), CD8-PE cl. HIT8A (555635; BD, 1:50), CD13-APC cl. WM15 (557454; 
BD, 1:50), CD66b-BV421 cl. G10F5 (555724; BD, 1:50), CD14-V500 cl. WM53 (561816; BD, 1:100), CD235a-PE or -APC 
Glycophorin A, HIR2, BD 1:100), CD71-AF647 (DF1513, Santa Cruz 1:100), CD41a-APC cl. HIP8 (579777; BD, 1:20), CD42b-PEcy7 
cl. HIP1 (303916; Biolegend, 1:100), AnnexinV-PE (556422; BD, 1:50), BrdU-PE (556029; BD, 1:100), CD10-AF700 cl. HI10a 
563509; BD, 1:100), CD33-PE cl. WM53 (555450; BD, 1:50), CD24-BV711 cl. MLS (563401; BD, 1:100), CD20-BV650 cl. 2H7 
563780; BD, 1:100), anti-MLLT3, Genetex, GTX102835; anti-V5, Abcam, ab15828; H3K4me3, Abcam ab8580; H3K9ac, CST 
#9649; H3K9cr, PTM #PTM-516; H3K36me3, Abcam ab9050; H3K79me2, Abcam, ab3594; RNA-polll Rbp1 (8WG16) Biolegend 
664911, H3K27ac, AM #39133; H3K4me1, AM #39297; H3K27me3, AM #39155; H3K9m3, AM #39161. anti-CD34 microbeads 
Miltenyi Biotech 130-100-453) 
Validation LLT3 antibody was validated in ChIP by using negative (K562, not expressing MLLT3) and positive (MLLT3-OE, overexpressing 
LLT3) cells, as shown in the manuscript. We provide the catalog number, manufacturer and clone (where applicable) for all 40 


FACS and ChIP antibodies listed above. All the available information on antibody validation can be found by searching the 
manufacturer’s websites. 


Eukaryotic cell lines 
Policy information about cell lines 


Cell line source(s) K562, HepG2, 293T (from ATCC). OP9M2, subcloned in our lab from OP9 (see Magnusson, et al Plos One 2013) 


Authentication OP9M2, was extensively authenticated (Magnusson, et al Plos One 2013). The rest of the lines, used only for knockdown 
validation (K562, HepG2) and virus production (293T), were not validated. 


Mycoplasma contamination Not Tested 


Commonly misidentified lines No commonly misidentified cell lines were used 
(See ICLAC register) 
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Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals For human HSC tranplantation, 8-12 weeks females from the NSG (NOD.Cg-Prkdcscid II2rgtm1Wjl/SzJ) strain were used 

Wild animals No wild animals were used in these studies 

Field-collected samples No field collected samples were used in these studies 

Ethics oversight All experiments were carried out in accordance with ethical care guidelines set by University of California on Laboratory Animal 


Care. Specific protocol numbers available on request. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Genome browser session o Longer applicable 
(e.g. UCSC) 


Methodology 


Replicates All ChIPs performed to study the MLLT3-OE effect on FL-HSPC were repeated at least 2 times: (ChIPseq K79 MLLT30E vs CTR 
n=5). Some genomic distribution data of histone marks on uncultured HSPC (Fig.3 c,e) were performed only once and 
checked against deposited data on similar cell types. 
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Sequencing depth RNAseg libraries were multiplexed in 4-5 samples per lane of a Nextseq run in PE75 
ChiPseq libraries were multiplexed in 8-10 samples per lane of a Hiseq4000 run in SE50 
ATACsegq libraries were multiplexed in 4-6 samples per lane of a Hiseq2500 run in PESO 
Sequencing depth for samples in Fig. 5 expressed as uniquely aligned reads (or pairs for RNAseq) to hg19 genome: 
Assay Sample Exp number UMR 
Aseq Uncult. ex1 17393805 
Aseq Uncult. ex2 19671545 
Aseq Uncult. ex3 15297822 
Aseq CTR ex1 16996453 
Aseq CTR ex2 16227199 
Aseq CTR ex3 16653389 
Aseq CTR ex4 14875982 
Aseq CTR ex5 16022515 
Aseq CTR ex6 15218252 
Aseq MLLT3-OE ex1 16798850 
Aseq MLLT3-OE ex2 17009253 
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Methodology 
Sample preparation FACS analysis was performed using single cell suspensions prepared from human fetal liver and cord blood after isolation from 
the tissues or culture, or obtained from NSG mice transplanted with human cells. Cells were stained in PBS 5%FBS. 
Instrument Cells were assayed on a BD-LSRII flow cytometer. Cell sorting was performed using a BD FACS Aria II. 
Software BD DIVA v8 


FlowJo software (Tree Star Inc.) 
Cell population abundance Cell purity was checked periodically when setting up the sorting instruments, resulting in >95 % purity 
Gating strategy For both flow cytometry analysis and sorting, cells were selected for live cell scatter in FSC/SSC then for singlets in FSCa/FSCh or 


FSCh/FSCw and SSCh/SSCw then for 7AAD negativity (dead cell exclusion). Other markers were determined positive when signal 
was above FMO (fluorescence minus one) control. 
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Mammals form scars to quickly seal wounds and ensure survival by an incompletely 
understood mechanism’ >. Here we show that skin scars originate from prefabricated 
matrix in the subcutaneous fascia. Fate mapping and live imaging revealed that fascia 
fibroblasts rise to the skin surface after wounding, dragging their surrounding 
extracellular jelly-like matrix, including embedded blood vessels, macrophages and 


peripheral nerves, to form the provisional matrix. Genetic ablation of fascia 
fibroblasts prevented matrix from homing into wounds and resulted in defective 
scars, whereas placing an impermeable film beneath the skin—preventing fascia 
fibroblasts from migrating upwards—led to chronic open wounds. Thus, fascia 
contains a specialized prefabricated kit of sentry fibroblasts, embedded withina 
movable sealant, that preassemble together diverse cell types and matrix 
components needed to heal wounds. Our findings suggest that chronic and excessive 
skin wounds may be attributed to the mobility of the fascia matrix. 


Mammalian scarring occurs when specialized fibroblasts migrate into 
wounds to deposit plugs of extracellular matrix’. Abnormal scarring 
results in either non-healing chronic wounds or aggravating fibrosis” *, 
whichrepresent a high burden for patients and to the global healthcare 
system. In the US alone, costs related to impaired scarring amount to 
tens of billions of dollars per year’. 

The origin of fibroblasts in wounds and the mechanism by which they 
act remain unclear®. Possible sources, suchas papillary (upper) and retic- 
ular (lower) dermal layers’, pericytes’, adipocytes””° and bone-marrow 
derived monocytes" have been suggested. We previously demonstrated 
that all scars in the back skin derive from a distinct fibroblast lineage 
expressing the engrailed 1 gene (Fn1) during embryogenesis and we refer 
tothese cells as Enl-lineage positive fibroblasts (EPFs)”””’. This lineage is 
present not only inthe skin but also in the strata underneath, called fascia. 

The fascia is a gelatinous viscoelastic membranous sheet that creates 
a frictionless gliding interface between the skin and the body's rigid 
structure below. Mouse back-skin fascia extends as a single sheet sepa- 
rated fromthe skin by the Panniculus carnosus (PC) muscle, whereas in 
humans back-skin there is no intervening muscle and the fascia consists 
of several thicker sheets that are continuous with the skin. In humans 
the facia layers incorporate fibroblasts, lymphatics, adipose tissue, 
neurovascular sheets and sensory neurons». 

Here we explored the fundamental mechanisms of scar formation by 
using matrix-tracing techniques, live imaging, genetic-lineage tracing 
and anatomic fate-mapping models. We identified the fascia as a major 
source of wound-native cells, including fibroblasts. Notably, we found 
that wound provisional matrix originates from prefabricated matrix in 
the fascia that homes in to open wounds as a movable sealant dragging 
along vasculature, immune cells and nerves, upwards into the skin. 


Wound cells rise from fascia 


To trace the origins of cells in wounds, we developed a fate-mapping 
technique by transplanting chimeric skin and fascia grafts into living 
animals (Fig. 1aand Methods). 

At 14 days post-wounding (dpw), 80.04 + 3.443% (mean +s.e.m.) of 
the labelled cells inthe wound originated from fascia (Fig. 1b). Fascia- 
derived cells filled the entire wound bordering the regenerated epi- 
dermis and even the surrounding dermis, making up 35.46 + 4.938% 
of the total labelled cells within a 0.2 mm radius (Fig. 1b, c). ACTA2* 
myofibroblasts (81.63 + 12.84%), nerve cells, endothelial cells and 
macrophages within wounds were predominantly of fascia origin 
(Fig. 1d, e). Independently, in vivo labelling of the fascia showed same 
results (Extended Data Fig. la, Methods). Labelled cells populated 
the wounds and surrounding dermis at 14 dpw, whereas in uninjured 
controls, labelled cells remained in the fascia (Extended Data Fig. 1b). 
Up to 56.71 + 9.319% of fascia-derived cells in wounds expressed clas- 
sical fibroblast markers (Extended Data Fig. 1c). Labelled monocytes 
(macrophages), lymphatics, endothelium and nerves also derived 
from fascia (Extended Data Fig. 1d). Collectively, our two independent 
fate-mapping approaches demonstrate that fascia is a major reservoir 
of fibroblasts, endothelial cells macrophages and peripheral nerves 
that populate wounds following injury. 


Fascia fibroblasts dictate scar severity 


We then analysed the scar-forming EPFs across dermal and fascia 
compartments by using a TdTomato-to-GFP replacement reporter?” 
(EnI;R26™”°; see Methods). Fibroblasts were the predominant fascia 
cell type (71.1%), whereas dermis had a significantly lower fraction of 


'Group Regenerative Biology and Medicine, Institute of Lung Biology and Disease, Helmholtz Zentrum Munchen, Munich, Germany. 7Research Unit Analytical Pathology, Helmholtz Zentrum 
Miinchen, Munich, Germany. *Mira-Beau Gender Esthetics, Berlin, Germany. “Wound Repair Unit, CBIB, Department of Biology and Biochemistry, University of Bremen, Bremen, Germany. 
5Department of Dermatology and Allergology, School of Medicine, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany. °German Centre for Lung Research (DZL), 
Munich, Germany. ’These authors contributed equally: Donovan Correa-Gallegos, Dongsheng Jiang. *e-mail: yuval.rinkevich@helmholtz-muenchen.de 


Nature | Vol576 | 12 December 2019 | 287 


Article 


c 


Fascia (GFP) Skin (TdTomato) 


Fascia Skin Fascia Skin 


Fascia Skin 


Fascia Skin 
P=0.0021 


Endothelial 
a 3 
pe 
oo 


Fascia Skin 
P=0.0001 


Monocytes and 
macrophages 
a 3 
PP 


° 


Fascia — Skin 


fa} 
L 


oO 
° 


a 
L 


a 
2 
af 
£ 
ror 
—E 
> 
a) 


° 


Fascia — Skin 


Fig. 1|Fasciais a major cellular source for wounds. a, Generation of 6-mm 
diameter chimeric grafts with an inner 2-mm wound to determine 
contributions of dermis and fascia. b, Percentages of TdTomato’ or GFP’ cells 
from total labelled cells in the wound and wound margin. Data are 
mean+s.e.m.,n=26 images from 4 biological replicates. One-way ANOVA, 
multiple comparison Tukey’s test, 95% confidence interval (CI).c, Wound 
showing skin- and fascia-derived cells at 14 dpw. d, e, Immunolabelling and 
contributions of myofibroblasts (ACTA2), nerves (TUBB3), blood vessels 
(PECAM1), monocytes or macrophages (MOMA-2) and lymphatic cells (LYVE1). 
Data are mean +s.e.m.;n=4 (LYVE1), 5 (ACTA2, TUBB3 and MOMA-2) or 8 
(PECAMI) images from 4 biological replicates. Unpaired two-tailed f-test, 95% 
CI. Dotted lines delimit the wound. Arrowheads indicate injury site. Scale bars, 
200 um. TdTom, TdTomato. 


fibroblasts (56.4%, Extended Data Fig. 2a, b). Within this population, 
there were twice as many EPFsas En1-naive fibroblasts (ENFs) in the fas- 
cia (61.2% and 31.8%, respectively). In dermis, there was a sixfold excess 
of EPFs (83.13% EPFs versus 12.78% ENFs; Extended Data Fig. 2c, d). 
Fascia was also enriched in regenerative cell types such as endothelial 
cells and lymphatics, whereas populations of macrophages and nerve 
cells were similar in both compartments (Extended Data Fig. 2e). Thus, 
a higher proportion of fibroblasts, endothelial cells and lymphatic 
cells and a lower EPF to EMF ratio distinguishes the fascia from dermis. 

Two-photon microscopy revealed that fascia EPFs assemble in mon- 
olayers of consecutive perpendicular sheets across the dorsal-ventral 
axis (Extended Data Fig. 2f, Supplementary Video 1). EPFs populate 
the entire back in topographic continua extending from the fascia 
and traversing the PC (Extended Data Fig. 2g, h Supplementary Video 
2). Regions where PC ended or where nerve bundles and blood ves- 
sels traversed the PC also showed continuums of EPFs without clear 
boundaries (Extended Data Fig. 2i,j). To test whether fascia EPFs could 
access dermal layers upon injury, we generated superficial excisional 
wounds. Aggregates of EPFs rising into open wounds from fissures in 
the PC were observed after only 3 dpw (Extended Data Fig. 2k, Sup- 
plementary Video 3). Collectively, our observations suggest that fascia 
EPFs easily traverse upwards into dermal layers during wounding and 
are unobstructed by the PC muscle. 
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Fig. 2| Fascia EPFs dictate scar severity. a, Dermal versus fascia EPFs chimeric 
grafts. Chimeric grafts were prepared as in Fig. 1a, using the EPF reporter line 
En1°;R26™'™° combined with Cre-negative littermate samples to trace EPFs in 
each compartment. b, Images showing fascia EPFs (left) or dermal EPFs (right) 
in wounds after a deep (top) or superficial injury (bottom). c, Wound size. Data 
are mean+s.e.m.;n=70 (deep) and 53 (superficial) images analysed from5 
biological replicates. Unpaired, two-tailed t-test, 95% CI. d, Numbers of fascia 
and dermal EPFs. Dataare mean +s.e.m.; n= 27,32, 27 and 22 images analysed 
from 5 biological replicates. Unpaired, two-tailed t-test, 95% Cl.e, f, Plots of EPF 
fractions and wound size from fascia (e) and dermal EPFs (f). n= 57 (e) and 48 (f) 
images analysed from5S biological replicates. Two-tailed Pearson’s R” 
correlation, 95% CI. Dotted lines delimit wounds. Scale bars, 200 pm. 


The size of a scar increases with the depth of the injury’®. We there- 
fore investigated whether this correlation can be attributed to fascia 
by analysing the extent of fibroblast contributions from the fascia 
and dermis in deep versus superficial wounds. For this, we combined 
genetic lineage-tracing (En1“°;R26”") with our anatomic fate-mapping 
chimeric grafts and inflicted superficial or deep injuries (Fig. 2a, Meth- 
ods). Fourteen days post-wounding, mean wound size of deep injuries 
was 1.7 times that of superficial injuries (Fig. 2b, c). There were twice 
as many fascia EPFs in deep wounds, whereas the number of dermal 
EPFs remained constant in both conditions (Fig. 2d). The abundance of 
fascia EPFs in the wound directly correlated with wound size and thus 
scar severity, whereas dermal EPFs showed no such correlation (Fig. 2e, 
f). No crossing of EPFs between these compartments was observed in 
uninjured controls, indicating that the influx of fascia EPFs was trig- 
gered by injury (Extended Data Fig. 3a, b). 

Long-term tracing of fascia EPFs in wounds showed that they recede 
ten weeks after injury (Extended Data Fig. 3c). This desertion from 
mature scars occurred through an apoptosis-independent mechanism, 
indicated by alow rate of cell death (<5%) across earlier time points 
(Extended Data Fig. 3d, e). 

We next sought to place fascia EPFs in the framework of known line- 
age markers used to define different populations of wound fibroblasts 
suchas CD24, CD34, DPP4, DLK1and LY6A”""”. All markers were promi- 
nent in fascia EPFs and were surprisingly downregulated upon entering 
the wound in our graft experiments (Extended Data Fig. 4). Flowcytom- 
etry confirmed the higher expression of DPP4, ITGB1, LY6A and PDGFRa 
in fascia than in dermal fibroblasts (Extended Data Fig. 5a—c). Sorted 
fascia EPFs also revealed low cellular heterogeneity, with the population 
predominantly comprising LY6G*PDGFRo’ (87.0%) and DPP4*ITGB1* 
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Fig. 3 | Fascia matrix steers into wounds. a, SHG (a) and SEM images (b) of 
fascia (left) and dermis (right) showing matrix arrangements, representative 
images of 3 biological replicates. c, Fractal dimension and lacunarity values 
from SHG images. n= 6 (dermis) and 3 (fascia) images analysed from 3 
biological replicates. Unpaired two-tailed t-test, 95% CI. d, Time-lapse images 
of fascia in culture at time O (left) and 30h (right). Representative video from at 
least three independent experiments. Autof., autofluorescence. e, Contraction 
rate from the SHG and autofluorescence. Data derived from Supplementary 
Video 4. f, Fascia matrix labelling with AF647 NHS ester in chimeric grafts. 
TdTomato and GFP reporter lines biopsies were used for the chimeric grafts as 
before. Only the matrix of the GFP-positive fascia compartment was labelled 
before transplantation. g, Images at 7 dpw showing fascia matrix covering the 
wound. Representative image of at least three biological replicates. h, Insitu 
fascia matrix-tracing experiment using FITC NHS ester. Subcutaneous 
injections of FITC NHS ester were performed 4 and 2 days before wounding in 
the back-skin of wild-type mice to label the fascia matrix. i, Uninjured controls 
showing specificity of the labelling in fascia. Representative sample of at least 
three biological replicates. j, Images from 7 dpw showing fascia matrix 
covering the wound area. Representative samples of at least three biological 
replicates. Arrowheads indicate the original injury. Broken lines delimit dermis 
(g andj) or PC (i). Scale bars, 30 pm (a, b), 500 pm (d,g, i,j), and100 pm (g,j 
magnified insets). 


(72.8%) cells (Extended Data Fig. 5d). This broad marker convergence 
identifies fascia EPFs as the major source of wound fibroblasts. 


Provisional matrix emerges from fascia matrix 


We then looked at the fascia matrix itself. Second harmonic genera- 
tion (SHG) signal and scanning electron micrographs (SEM) revealed 
profuse coiled collagen fibrils in the fascia, indicative of a relaxed and 
immature matrix (Fig. 3a, b). Fractal measurements” of the fibre align- 
ments showed a more condensed matrix configuration in fascia than 
the stretched and woven dermal matrix (Fig. 3c). 

Given the immaturity of the fascia matrix, we checked whether it 
could work asarepository for scar tissue. We developed an incubation 
chamber that enabled live imaging of fascia biopsies over several days 
(Methods). Recording of SHG signals showed steering of the matrix 
at arate of 11.4 um h" (Fig. 3d, e, Supplementary Video 4). Assuming 


a similar rate in vivo, the fascia matrix could move about 2 mm in7 
days, accounting for the dynamics of provisional matrix deposition 
inmammals. 

To test whether fascia matrix steers into wounds in vivo, we devel- 
oped a method to trace the fascia matrix in our chimeric grafts using 
NHSesters. (Fig. 3f, Methods). Streams of traced matrix from the fascia 
extended upwards and plugged the open wounds from 7 dpw (Fig. 3g, 
Extended Data Fig. 6a, b). Fascia-derived matrix covered 74.78 + 12.94% 
of total collagen content in the wound (Extended Data Fig. 6c). Our data 
indicated that individual fascia matrix fibres were not being pulled; 
instead, pliable matrix was extended upwards to mould the wound. 
Advanced wound stages showed a decline in label in specific regions 
of the wound, suggesting an active remodelling process of the fascia- 
derived matrix (Extended Data Fig. 6d-f, Supplementary Video 5). 

We then tested whether dermal matrix could be steered by double 
labelling our chimeric grafts. Only the fascia matrix plugged deep injury 
wounds (Extended Data Fig. 6g-j), whereas dermal matrix remained 
immobile in deep and superficial injuries; the superficial injuries healed 
via de novo matrix deposition (Extended Data Fig. 6k, l). 

To provide further evidence showing that fascia matrix migrates to 
open wounds, we labelled the fascia matrix in situ before injury (Fig. 3h, 
i, Methods). Labelled matrix made up most of the wound provisional 
matrix (Fig. 3j), which underwent remodelling during the first two 
weeks following injury (Extended Data Fig. 7a, b). Fractal measurements 
showed that fascia fibre interfaces expanded by 3 dpw, changing from 
a parallel sheet arrangement to a highly porous plug. This expansion 
was followed by contraction into thicker and more complex mature 
scar matrix architecture (Extended Data Fig. 7c-e). Surprisingly, traced 
matrix was also present in the eschar. Activated platelets infiltrated 
and clustered within fascia fibres before eschar formation (Extended 
Data Fig. 7f), indicating that the coagulation cascade occurs in parallel 
with fascia matrix steering. 


EPFs steer fascia matrix into wounds 


To test whether matrix steering from fascia is caused by EPFs, we 
blocked fascia by implanting an impermeable dual surface expanded 
polytetrafluoroethylene (ePTFE) membrane” between the fascia and 
the PC in wounds of EnI“*;R26""""" mice (Fig. 4a, Methods). Surpris- 
ingly, wounds with implants remained completely open whereas sham 
controls closed within 21 days (Fig. 4b). After two months, EPFs trailed 
from the wound margins and under the membrane without generating 
scars (Fig. 4c, Extended Data Fig. 8a). Implants produced a transient 
inflammation that resolved even when the wound remained open. Two- 
month-old wounds with implants showed normal leukocyte and pro- 
and anti-inflammatory interleukins levels (Extended Data Fig. 8b-i), 
consistent with the clinical use of ePTFE as immunologically inert 
membranes. The coagulation cascade also was unaltered at the bor- 
der between the dermis and the membrane (Extended Data Fig. 8), k). 
These results indicate that the lack of scarring with ePTFE membranes 
does not reflect chronic inflammation or poor clotting, but rather a 
blockade of fascia steering mediated by the fascia fibroblasts. These 
findings further support the notion that scar tissue is mostly derived 
from the fascia, since dermal EPFs or dermal matrix are unable to repair 
wounds in the absence of fascia movements. 

We next investigated whether mechanical separation between dermis 
and fascia alone, without barrier implants, would affect matrix steer- 
ing and scar formation. To address this question, we performed full 
excisional wounds in wild-type mice and physically released the fascia 
belowthe PC around the wound (Fig. 4d). Wound closure from released- 
fascia wounds was significantly delayed, and wounds remained open 
early on, similarly to those documented following membrane implan- 
tations (Fig. 4e, f). 

To definitively link fascia EPFs to matrix steering, we genetically 
ablated fascia EPFs using two separate strategies. First, we used a 
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Fig. 4 | Fascia EPFs steer scar primordium into wounds. a, 8-mm diameter 
ePTFE membranes were implanted between the skin and fascia in fresh 6-mm 
diameter wounds to block fascia steering. b, ePTFE-implanted or sham wound 
closure (left) determined from photographs (right) at specified time points. 
Individual values, n =3 biological replicates. Unpaired two-tailed t-test, 95% Cl. 
c, Sham (left) or ePTFE-implanted (right) wounds at 63 dpw. Trichrome staining 
(top) and collagen immunolabelling (middle). Magnifications (1,2 (bottom)) 
show multiclonal dermal EPFs. Representative images from 3 biological 
replicates. d, Fascia tissue around fresh 5-mm diameter wounds was 
mechanically separated from the skin above using a metal paddle. e, Fascia- 
released or control wound closure (left) determined from photographs (right) 
at specified time points. Individual values, n= 8 (0-3 dpw), 6 (5 dpw), 4 (7 dpw) 
and 2 (10 dpw) images from 8 biological replicates. Unpaired two-tailed t-test, 
95% CI. f, Trichrome-stained wounds at 3, Sand 7 dpw from control or fascia- 


transgenic line that expresses the diphtheria toxin receptor (DTR) ina 
Cre-dependent manner (R26), enabling us to deplete cells expressing 
Cre recombinase upon exposure to diphtheria toxin (DT). We thus gen- 
erated Cre-expressing adeno-associated viral particles (AAV6-Cre) and 
injected them into the fascia of R26”" pups underneath freshly made full 
excisional wounds (Fig. 4g). Scar size from AAV6-Cre transduced mice 
treated with DT were significantly smaller those from controls (Fig. 4h, i). 

Second, we used En1”;R26°"" double transgenic mice, in which 
DTR expression is restricted to EPFs, making them susceptible to 
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released wounds. Representative images from 8 biological replicates. g, Fascia 
cell depletionin R26” neonates. h, Fluorescence image (left) and trichrome- 
stained wounds at 7 dpwin GFP- (middle) or Cre-transduced (right) fascia. 
Representative images from 3 biological replicates. i, Scar-length 
measurements. Data are mean +s.e.m.;n=4 and 8 sections from 3 biological 
replicates. Unpaired two-tailed t-test, 95% Cl.j, Depletion of fascia EPFsin 
chimeric grafts with fascia matrix labelling. k, Immunolabelling for collagens, 
and fascia matrix in control (left) or DT-treated (right) grafts. Representative 
images from 3 biological replicates. I, Matrix-labelling coverage. Data are 
mean +s.e.m.;n=6 sections from 3 biological replicates. Unpaired two-tailed 
t-test, 95% CI. Dashed lines delimit the implant. Dotted lines delimit the wound. 
Arrowheads indicate the original injury. Arrow indicates remaining labelled 
fascia matrix in DT-treated grafts. Scale bars, 50 pm (c (Land 2)), 200 pm (h), 
and 500 pm (c (main image), f,h,k). 


DT-mediated ablation. We corroborated the ablation of fascia EPFs in 
cultured biopsies 6 days after acute exposure to DT for 1h. Effective 
dose of DT prevented the normal increase in collagen fibre density 
observed in control samples and decreased the cell density by 2.5 times 
(Extended Data Fig. 9a—c). Live imaging showed absence of any matrix 
steering after DT exposure (Extended Data Fig. 9d, e, Supplementary 
Video 6), confirming that fascia EPFs are essential for matrix steering. 

Next, we created chimeric grafts using dermis from wild-type mice 
and fascia from EnI’;R26” mice. We ablated fascia EPFs using DT 
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Fig. 5 | Marker signature in keloids and fascia. a, b, Macroscopic images and 
trichrome staining of human back (a) and abdominal (b) skin showing fascia 
layers embedded in subcutaneous tissue. Arrows indicate the fascia. 
Representative images from 4 independent samples. c, Immunolabelling for 
DPP4, CD44 and FAP, and NOV and ACTA2 in fascia, dermis and keloids of 
human back skin. d, Relative fluorescence intensity (RFI) inc, normalized tothe 
dermis. Dataare mean+s.e.m.;n=4 images analysed from 4 biological 


and labelled the matrix before transplantation (Fig. 4j). Ablation of 
fascia EPFs prevented matrix steering into the wound (Fig. 4k, |) and 
instead labelled matrix remained in the fascia layer below. Together, 
our data demonstrate that fascia resident EPFs actively steer matrix 
to seal open wounds. 

To check whether fibroblast proliferation preceded and was needed 
for matrix steering, we analysed the proliferation rate in our matrix- 
tracing experiments. Expansion of the fascia gel beneath the wound 
occurred during the first days after injury, whereas cell proliferation 
peaked after one week (Extended Data Fig. 10a—c), indicating that pro- 
liferation is not required for matrix steering. Furthermore, treatment 
with a proliferation inhibitor had no effect on fascia matrix steering 
in vitro (Extended Data Fig. 9f-k, Supplementary Video 7). Our results 
demonstrate that fascia matrix works as an expanding sealant that 
quickly clogs deep wounds independently of cell proliferation. 


Fascia and keloid share marker signatures 


Similar to mouse fascia matrix, human keloids present poorly struc- 
tured and densely packed collagen fibres'®. This motivated us to inves- 
tigate the presence of fascia fibroblasts in keloid tissue. We screened 
for markers present in keloids and compared them with healthy dermis 
and the connective tissue in the subcutaneous space (fascia) of human 
skin across multiple anatomic locations (Fig. 5a, b). FAP and DPP4 were 
highly expressed in both fascia and keloids, with low expression in der- 
mis. The fascia-restricted protein NOV was also prominently expressed 
in bothhuman and mouse fascia, as well as in human keloids and mouse 
scars (Fig. 5c-g). This preservation of fascia markers across mouse 
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and human fascia and keloid scars suggests a common fascia origin 
for most human cutaneous scars. 


Discussion 


Current models of wound healing propose that dermal fibroblasts 
migrate into wounds and locally deposit matrix de novo onto the 
granulation tissue provided by the coagulation cascade. This provi- 
sional matrix is then remodelled into a mature scar. On the basis of our 
findings in this study, we propose a revised model (Fig. 6) in which, in 
deep injuries, fascia fibroblasts pilot their local composite matrix into 
wounds that, in coordination with the coagulation cascade, form the 
provisional matrix. Thus, instead of de novo matrix deposition by der- 
mal fibroblasts, a ‘scar primordium is steered by the fascia fibroblasts. 
Thus, fascia serves as an external repository, or externum repono, of 
scar-forming provisional matrix, which represents an efficient mecha- 
nism to quickly seal large open wounds. Previous studies have shown 
that the matrix undergoes movement during early development and 
organ morphogenesis” ~. To our knowledge, the extent and magnitude 
of matrix movements that we document here have not been observed 
during injury or regenerative settings. Cultured dermal fibroblasts have 
been shown to pulland reorient individual collagen or fibronectin fibres 
locally in cultured plates and in 3D in vitro assays”**. However, our find- 
ings reveal highly dynamic and large-scale movements of composite 
tissue matrix during injury that are mediated exclusively by specialized 
fibroblasts of the fascia. 

Our findings on the contribution of fascia to large scars and its block- 
age leading to chronic open wounds indicates that the spectrum of poor 
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Fig. 6 |Revised wound healing model. Superficial injuries heal by the classical 
fibroblast migration and de novo matrix deposition process. In responsetoa 
deep injury, fascia fibroblasts steer their surrounding tissue into wounds. 
Fascia-derived macrophages, endothelial and peripheral nerves rapidly clog 
the open wound. In coordination with the platelet response, the fascia matrix 
serves as a provisional matrix that undergoes remodelling until it formsa 
mature scar. 


and excessive scarring in the skin, such as in diabetic and ulcerative 
wounds, as well as in hypertrophic and particularly keloid scars, might 
all be attributed to fascia. Indeed, the subcutaneous fascia varies widely 
according to species, sex, age and anatomic skin location”. In some 
mammals, the superficial fascia is loose, whereas in scar-prone species 
suchas human, dog and horse, the superficial fascia is thicker. Human 
fascia further varies in thickness in different regions of the body”® 
For example, the lower chest, back, thigh and arm have much thicker 
and multi-layered membranous sheets, and it is these anatomic sites 
that are prone to form hypertrophic and keloid scars”’. Understand- 
ing the topographic anatomy of the fascia layer may help explain scar 
phenotypes and severities, including the occurrences of hypertrophic 
and keloid scars. 
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Methods 


Mice and genotyping 

Allmouse strains (C57BL/6J, En1°, R26", R26"™°, R26'®, Rag2’ and 
Fox Chase SCID) were obtained from either Jackson laboratories, Charles 
River or generated at the Stanford University Research Animal Facility 
as described previously”. Animals were housed at the Helmholtz Center 
Animal Facility. Cages were maintained at constant temperature and 
humidity with a 12-h light cycle. Animals were supplied with food and 
water ad libitum. All animal experiments were reviewed and approved 
by the Government of Upper Bavaria and registered under the pro- 
jects 55.2-1-54-2532-16-61 and 55.2-2532-02-19-23 and conducted under 
strict governmental and international guidelines. This study is com- 
pliant with all relevant ethical regulations regarding animal research. 
Unless specified, 8-to-10-week-old adult mice were used for the ani- 
mal experiments. Both male and female mice were used. Cre-positive 
(Cre*) animals from double-transgenic reporter mice were identified 
by detection of relevant fluorescence in the dorsal dermis. Genotyping 
was performed to distinguish mouse lines containing a 200-base pair 
(bp) cre fragment (cre) from the wild-type (cre’ ). Genomic DNA from 
the ear-clips was extracted using QuickExtract DNA extraction solu- 
tion (Epicentre) following the manufacturer's guidelines. DNA extract 
(11) was added to each 24 pI PCR. The reaction mixture was set up using 
Taq PCR core kit (Qiagen) containing 1x coral buffer, 1OmM dNTPs, 
0.625 units Taq polymerase, 0.5 uM forward primer ‘Cre genotype 4F’ 
5’-ATTGCTGTCACTTGGTCGTGGC-3’ (Sigma) and 0.5 1M reverse primer 
‘Cre genotype 4R’ 5’-GGAAAATGCTTCTGTCCGTTTGC-3’ (Sigma). PCRs 
were performed with initial denaturation for 10 min at 94 °C, amplifica- 
tion for 30 cycles (denaturation for 30 s at 94 °C, hybridization for 30s 
at 56 °C, and elongation for 30 s at 72 °C) and final elongation for 8 min 
at 72 °C, and then cooled to4 °C. Inevery experiment, negative controls 
(non-template and extraction) and positive controls were included. The 
reactions were carried out in an Eppendorf master cycler. Reactions 
were analysed by gel electrophoresis. 


Human skin samples 

Fresh human skin and scar biopsies, from various anatomic locations, 
were collected from donors between 18-65 years of age, through the 
Section of Plastic and Aesthetic Surgery, Red Cross Hospital Munich (refer- 
ence number 2018-157), and by the Department of Dermatology and Aller- 
gology, Klinikum rechts der Isar Technical University Munich (reference 
number 85/18S). Informed consent was obtained fromall subjects before 
skin biopsies. Upon collection, these samples were directly processed for 
tissue culture or fixed with PFA and then processed for cryosection or 
paraffin section followed by histological or immunofluorescent analyses. 


Fascia in vitro culture 

Two in vitro systems were used. To visualize the changes in matrix 
architecture in real time, 2-mm-diameter biopsies were excised from 
PO C57BL/6J neonates and processed for live imaging (SCAD assay, 
Patent Application no. PLA17A13). To determine the effectiveness of 
the DT treatment, muscle and fascia were manually separated from the 
rest of the skin in the chimeric grafts experiments and incubated with 
DT at different concentrations for 1h at ambient temperature. Next, 
samples were washed with PBS and incubated in DMEM/F12 (Thermo 
Fisher) supplemented with 10% serum (Thermo Fisher), 1% penicillin— 
streptavidin (Thermo Fisher), 1% GlutaMAX (Thermo Fisher) and 1% 
non-essential amino acids solution (Thermo Fisher) in a 37 °C, 5% CO, 
incubator. Medium was routinely exchanged every other day. Samples 
were fixed at day 6 of culture with 2% paraformaldehyde and processed 
for histology. 


Histology 
Tissue samples were fixed overnight with 2% paraformaldehyde in PBS 
at 4 °C. Samples were rinsed three times with PBS, embedded in optimal 


cutting temperature (OCT, Sakura Finetek) and flash-frozen on dry 
ice. Six-micrometre sections were made in a Cryostar NX70 cryostat 
(Thermo fisher). Masson’s trichrome staining was performed with a 
Sigma-Aldrich trichrome stain kit, according to the manufacturer's 
guidelines. For immunolabelling, sections were air-dried for 5 min 
and fixed with —20 °C-chilled acetone for 20 min. Sections were rinsed 
three times with PBS and blocked for 1h at room temperature with 10% 
serum in PBS. Then, the sections were incubated with primary antibody 
in blocking solution for 3 h at ambient temperature. Sections were then 
rinsed three times with PBS and incubated with secondary antibody 
in blocking solution for 60 min at ambient temperature. Finally, sec- 
tions were rinsed three times in PBS and mounted with fluorescent 
mounting media with 4,6-diamidino-2-phenylindole (DAPI). Primary 
antibodies used: goat-anti-ACTA2(aSMA) (1:50, Abcam), rabbit-anti- 
TUBB3 (1:100, Abcam), rat-anti-THY1(CD90) (1:100, Abcam), rat-anti- 
CD24 (1:50, BD biosciences), rabbit-anti-DPP4(CD26) (1:150, Abcam), 
rabbit-anti-PECAMI (1:10, Abcam), rat-anti-CD34 (1:100, Abcam), 
rabbit-anti-collagen I (1:150, Rockland), rabbit-anti-collagen III (1:150, 
Abcam), rabbit-anti-collagen VI (1:150, Abcam), rabbit-anti-DLK1 (1:200, 
Abcam), rat-anti-ERTR7 (1:200, Abcam), rat-anti-F4/80 (1:400, Abcam), 
rabbit-anti-LYVE1 (1:100, Abcam), rat-anti-MOMA2 (1:100, Abcam), 
goat-anti-PDGFRa (1:50, R & D systems), rat-anti-LY6A(Scal) (1:150, 
Biolegend), rat-anti-CD44 (1:100, Abcam), rabbit-anti-NOV/CCN3 (1:20, 
Elabscience), sheep-anti-FAP (1:100, R&D systems), rat-anti-IL12 (1:50, 
Biolegend), rat-anti-IL4 (1:50, Biolegend), rat-anti-CD19 (1:20, BD bio- 
sciences), hamster-anti-CD3e (1:100, Biolegend), rat-anti-NCRI1 (1:20, 
Biolegend) and rat-anti-LY6G (1:100, Abcam). PacificBlue-, Alexa Fluor 
488-, Alexa Fluor 568- or Alexa Fluor 647-conjugated antibodies (1:500, 
Life technologies) against suitable species were used as secondary 
antibodies. 


Microscopy 

Histological sections were imaged using a using a ZEISS Axiolmager. 
Z2m (Carl Zeiss). For whole-mount 3D imaging of wounds, fixed samples 
were embedded in35-mm glass bottom dishes (Ibidi) with low-melting 
point agarose (Biozym) and left to solidify for 30 min. Imaging was per- 
formed using a Leica SP8 multi photon microscope (Leica, Germany). 
For live imaging of fascia cultures, samples were embedded as just 
above. Attention was paid to mount the samples with the fascia fac- 
ing up towards the objective. Imaging medium (DMEM/F-12; SiR-DNA 
1:1,000) was then added. Time-lapse imaging was performed over 20h 
under the multi-photon microscope. A modified incubation system, 
with heating and gas control (ibidi, catalogue nos. 10915 and 11922), was 
used to guarantee physiologic and stable conditions during imaging. 
Temperature control was set to 35 °C with 5% CO,-supplemented air. 
Second harmonic generation signal and green auto-fluorescence as 
a reference were recorded every hour. 3D and 4D data was processed 
with Imaris 9.1.0 (Bitplane) and Image] (1.52i). Contrast and brightness 
were adjusted for better visibility. 


Image analysis 

Histological images were analysed using Image]. For quantification of 
labelled cells in our fate mapping experiments, we manually defined 
the wound, surrounding dermis, and adjacent fascia areas. We defined 
the wound as the area flanked by the near hair follicles on both sides, 
extending from the base of the epidermis down to the level of the hair 
follicles bulges. Surrounding dermis area was defined as the 200 um 
immediately adjacent to the wound on both sides. Fascia area was 
defined as the tissue immediately below the wound. The number of 
labelled cells in each area was determined by quantifying the parti- 
cles that were double-positive for DAPI and for the desired label (for 
example, Dil or GFP) channels. The coverage of the labelled matrix 
in the wound area was determined by quantifying the area that was 
double-positive for the labelled matrix and the collagen I-, III- and VI- 
staining signal. Cell density of En1°°;R26"* cultures treated with DT was 
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quantified by dividing the total cells (DAPI) by the matrix area (collagen 
I, Iand VI), Collagens density was calculated as the collagens area cov- 
erage of the entire section area. Matrix movements in live-imaged cul- 
tures were determined by tracking the length of the two furthest points 
along the sample in both the SHG and auto-fluorescence channels. 
Length measurements were normalized to the original length at time O. 
Wound size was normalized for each time point using the original area 
at day O. Scar length was quantified from randomly selected sections 
taken from the middle of the scar using the two flanking hair follicles 
as areference. RFI was calculated by measuring the mean grey value 
and normalize to the dermis images. Fractal analysis was performed 
using the Image] plug-in ‘FracLac’29 (FracLac2015Sep090313a9390) 
using the same settings and preprocessing as previously described”. 


Dil labelling of fascia in animals 

Two 5-mm-diameter full-thickness excisional wounds were created 
on the back of 8- to 10-week-old C57BL6/J mice with a biopsy punch. 
Lipophilic Vybrant Dil dye (Life technologies, V22885) (10-20 pl) was 
injected into the exposed fascia directly above the dorsal muscles. 
Wounded tissue was harvested 14 dpw and processed for histology 
and imaging by fluorescence microscopy. 


Chimeric skin transplantation 

Full-thickness 6-mm-diameter biopsies were collected from the back- 
skin of either R26"""°, R26", EnI“°-R26""™®, EnI°:R26°"* or C57BL6/J 
adult mice. Using the PC muscle layer as an anatomical reference, the 
fascia together with the muscle layer were carefully separated from 
the dermis and epidermis using Dumont no. 5 forceps (Fine Science 
Tools) and a 26G needle under the fluorescent stereomicroscope 
(M205 FA, Leica). EPFs from fascia + muscle samples of EnI°’;R26°" 
mice were ablated by incubation with 20 pg mI" of diphtheria toxin 
(Sigma-Aldrich, D0564) or only DMEM/F12 as vehicle for 1h at ambient 
temperature followed by 3 washing steps with PBS. At this point, the 
matrix samples were labelled by incubation with 100 uM Alexa Fluor 
NHS Ester (Life Technologies, A20006) or Pacific Blue succinimidyl 
ester (Thermo Fisher, P10163) in PBS for 1h at ambient temperature 
followed by 3 washing steps with PBS. Chimeras were made by placing 
the epidermis + dermis portion of a mouse strain on top of the mus- 
cle + fascia of another strain and left to rest for 20 min at 4 °C inside a 
35-mm culture dish with 2 ml of DMEM/F12. Special attention was paid 
on preserving the original order of the different layers (top to bottom: 
epidermis > dermis > muscle > fascia). Then, a2 mm ‘deep’ full thickness 
was excised from the chimeric graft using a biopsy punch inthe middle 
of the biopsy. To create ‘superficial’ wounds, the 2-mm excision was 
done only in the epidermis + dermis half before reconstitution with 
the bottom part. ‘Wounded’ chimeric grafts were then transplanted 
into freshly-made 4-mm-diameter full-thickness excisional wounds 
inthe back of either RAG2” or Fox Chase SCID immunodeficient 8- to 
10-week-old mice. Precautions were taken to clean out the host blood 
from the fresh wound before the transplant and to leave the graft to 
dry for at least 20 min before ending the anaesthesia, to increase the 
transplantation success. To prevent mice from removing the graft, a 
transparent dressing (Tegaderm, 3M) was placed ontop of the grafts. 


In situ matrix tracing and EdU pulses 

Eight- to ten-week-old C57BL6/J mice received subcutaneous 20-l injec- 
tions of 1Omg mI“ FITC NHS ester in physiological saline with 0.1M sodium 
bicarbonate pH9 (46409, Life technologies) four and two days before 
wounding. At 2, 6 or 13 dpw, mice received 200 plintraperitoneal injections 
of 1mg ml” EdU in PBS. Samples were collected 24 hafter the EdU pulse 
and processed for cryosection and imaging by fluorescence microscopy. 


Flowcytometry 


Fascia and dermis were physically separated from the back- 
skin of C57BL6/J or En1“°;R26™"™° mice under the fluorescence 


stereomicroscope as before. Harvested tissue was minced with surgical 
scissors and digested with an enzymatic cocktail containing 1 mg mI 
collagenase lV, 0.5 mg ml hyaluronidase, and 25 U mI DNase (Sigma- 
Aldrich) at 37 °C for 30 min. The resulted single cell suspension was 
filtered and incubated with conjugated/unconjugated primary antibod- 
ies (dilution 1:200) at 4 °C for 30 min, followed by anincubation witha 
suitable secondary antibody when needed at 4 °C for 30 min. Cells were 
washed and stained with Sytox blue dye (dilution 1:1,000. Life technolo- 
gies, S34857) for dead cell exclusion. Cells were subjected to flow cyto- 
metric analysis using a FACSAria III (BD Bioscience). Fibroblasts were 
gated as the lineage negative (Lin ) fraction not expressing the PTPRC, 
PECAMI, LY76, LYVE-1, or EPCAM markers. Primary antibodies used: 
anti-DLK1 (Abcam), anti-CD9 (Santa Cruz), anti-NGFR (Miltenyi), anti- 
F4/80 (Abcam), AlexaFluor790-anti-NG2 (Santa Cruz), FITC-anti-DPP4 
(eBioscience), PerCP-eFluor710-anti-ITGB1 (eBioscience), anti-CD34 
(Abcam), PerCP-Cy5.5-anti-CD24 (eBioscience), APC-Fire750-anti-CD34 
(Biolegend), APC-anti-ITGA7 (R&D systems), PerCP-Cy5.5-anti-LY6A 
(eBioscience), PE-Vio770-anti-PDGFRa (Miltenyi), PerCP-Vio700-anti- 
CD146 (Miltenyi), APC-anti-PECAMI (eBioscience), eFluor660-anti- 
LYVE1 (Thermo fisher), APC-LY76(TER119), APC-anti-EPCAM(CD326) 
and APC-anti-PTPRC. Secondary antibodies used: Alexa Fluor 488 goat 
anti-rabbit (Life technologies) and AlexaFluor568 goat anti-rat (Life 
technologies). 


Scanning electron microscopy 

Skin biopsies of adult CS7BL6/J mice were collected, and the fascia 
was manually separated as before. Samples were then fixed overnight 
with paraformaldehyde and glutaraldehyde, 3% each, in 0.1% sodium 
cacodylate buffer pH 7.4 (Electron Microscopy Sciences). Samples were 
dehydrated in gradual ethanol and dried by the critical-point method, 
using CO2 as the transitional fluid (Polaron Critical Point Dryer CPC 
E3000; Quorum Technologies) and observed by scanning electron 
microscopy (JSM 6300F; JEOL). 


ePTFE membrane implants 

These membranes are routinely used in the clinic to circumvent 
post-operative adhesions after laparoscopic ventral incisional hernia 
repairs”. Two 6-mm-diameter full-thickness excisional wounds were 
created with a biopsy punch on the back of 8-week-old EnI;R26"" or 
C57BL6/J mice. Sterile 8-mm-diameter ePTFE impermeable membranes 
(Dualmesh, GORE) were implanted between the surrounding skin and 
the fascia underneath, to cover the open wound on the right side. For 
this, the surrounding skin was loosen using Dumont no. 5 forceps and 
spatula (10090-13, Fine Science Tools). The dual-surface membrane was 
implanted with the attaching face facing out, so to promote dermal cell 
attachment, while the smooth surface was in direct contact with the 
fascia. The left sham control wound underwent the same procedure 
without implanting any membrane. Each wound was photographed at 
indicated time points, and wound areas were measured using Image]. 
Wound sizes at any given time point after wounding were expressed as 
percentage of initial (day 0) wound area. At 7 or 63 dpw, samples were 
collected and processed for histology. 


Released fascia injury in adult mice 

Two 5-mm-diameter full-thickness excisional wounds were created 
with a biopsy punch on the back of 8-week old male C57BL6/J mice. 
The skin around the wound on the left side was separated from the 
underneath skeletal muscle using a sterilized gold-plated 3 x 5mm 
genepaddles (Harvard Apparatus, 45-0122) to release the fascia layer. 
The right wound served as a control. Each wound was digitally pho- 
tographed at indicated time points, and wound areas were measured 
using Photoshop (Adobe Systems). Wound sizes at any given time point 
after wounding were expressed as percentage of initial (day 0) wound 
area. The harvested tissue at the indicated time points was processed 
for cryosection and Masson’s trichrome staining for histology. 


Viral particle production 

AAV6 expressing GFP or Cre recombinase were produced by trans- 
fecting the AAVpro 293T Cell Line (Takara Bio, 632273) with pAAV-U6- 
sgRNA-CMV-GFP (Addgene, 8545142) or pAAV-CRE Recombinase vector 
(Takara Bio, 6654), pRC6 and pHelper plasmids procured from AAVpro 
Helper Free System (Takara Bio, 6651). Transfection was performed 
with PEI transfection reagent and viruses were harvested 72 h later 
and purified with an AAVpro purification kit (Takara Bio, 6666) and 
titre was calculated using real-time PCR. 


Fascia cells ablation with AAV6-Cre viral particles and DT 
treatment in pups 

Two 3-mm-diameter full-thickness excisional wounds were created with 
abiopsy punch on the back of postnatal day 11 (P11) R26°” mice. Twenty 
microlitres of Cre-expressing AAV6-Cre or control AAV6-eGFP at viral 
titre of 5 x 10" mI“ were injected subcutaneously at the area between 
the two wounds. DT solution at 1 ng pl in PBS was intraperitoneally 
injected to each mouse once per day for 7 days at the dosage of 5ngg". 
Tissue was collected seven days after wounding. 


Statistics 

Statistical analyses were performed using GraphPad Prism software 
(v.6.0, GraphPad). Statistical test and Pvalues are specified in the figure 
legends and inthe corresponding plots. For simplicity, Pvalues below 
0.0001 were stated as equal to 0.0001. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Source Data for Figs. 1-5 and Extended Data Figs. 1-10 are provided with 
the paper. Additional information is available from the corresponding 
author on reasonable request. 
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Extended Data Fig. 1| Fate mapping of fascia cells with Dil. a, Dil labelling of 
fascia cells. b, Histology showing Dil’ cells in uninjured controls (left) and at 
14 dpw (right). Representative images of five biological replicates. 

c, Immunolabelling (top) and fractions (bottom) of Dil-positive cells 

expressing mesenchymal/fibroblast markers ITGB1, ER-TR7, THYland 

PDGFRa. Dataare mean +s.e.m.;n=4 (5inITGB1) images analysed from3 


% total Dil* cells in wound 
0 10 20 


Lymphatics 


Endothelial 


Nerves 


0 2 4 6 
% total Dil” area in wound 


8 10 


biological replicates. d, Immunolabelling (top) and fractions (bottom) of Dil 
positive monocytes/macrophages (MOMA-2), lymphatic cells (LYVE1), 
endothelial cells (PECAM1) and nerve cells (TUBB3). Data are mean+s.e.m 


n=Simages analysed from 3 biological replicates. Lines delimit PC. The dotted 
line delimits the wound. Scale bars, 200 pm. 
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Extended Data Fig. 2| Fascia EPFs traverse PC. a, Gating strategy for 
fibroblasts analysis. b, Percentages of fibroblasts (Lin’) and lineage-positive 
cells in fasciaand dermis. Data are mean+s.e.m.;n=4 independent 
experiments. Unpaired two-tailed t-test, 95% Cl. c, Scatter plots of EPFs 
(GFP*Lin’) and ENFs (TdTomato‘Lin ) in fasciaand dermis. Representative plots 
of three independent experiments. d, EPF and ENF fractions in fascia and 
dermis. Dataare mean +s.e.m.;n=4 independent experiments. Two-way 
ANOVA, multiple comparison Tukey’s test, 95% Cl. e, Endothelial cell (PECAM1’), 
lymphatic cell (LYVE1"), macrophages (F4/80") and nerve cell (NGFR’) fractions 
in fascia and dermis. Data are mean +s.e.m.; n=3 biological replicates. Two-way 
ANOVA, multiple comparison Tukey’s test, 95% Cl. f-k, Representative images 


of 3D-rendered En1°;R26™¢ or En1;R26"/° back skin fascia from at least 
three biological replicates. f, Lateral view (left) and cross-sections (right) of 
adult fascia. g, Top view (ventral side up) of neonate back skin. h, Top side view 
(top) and lateral cross-section (bottom) at the forelimb junction showing EPF 
traversing the PC. i, Top view at a muscle breach showing EPFs in both locations. 
j, Top view at a muscle opening where nerves pass through and polyclones of 
EPFs reside. k, Top view (top, epidermis side up) and lateral cross-section 
(bottom) of an adult superficial wound (3 dpw). Brocken lines delimit PC. 
Dotted lines delimit the epidermis. Scale bars, 1,500 pm (g), 100 um (f, i,j) and 
500 um (k); v, vessels; nb, nerve bundles. 
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Extended Data Fig. 3| Fascia EPFs maintain position in steady conditions 
and recede from wounds over time. a, Dermal versus fascia EPFs chimeras in 
uninjured conditions. b, Fascia (left) or dermal (right) EPFs-traced chimeras. 
Representative images of 3 biological replicates. c, Scars at 70 dpw from deep 
injuries of fascia EPF-traced chimeras immunolabelled for DPP4. 
Representative images of 3 biological replicates. d, Cleaved CASP3 expression 
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(top) or superficial (bottom) injuries at 14 dpw. e, Fractions of fascia or dermal 
EPFsin the wound, dermis or fascia control regions positive for cleaved CASP3. 
Data are mean +s.e.m.;n=6and 5 (fascia and dermal EPF, respectively) images 
analysed from 5S biological replicates. Lines delimit the border between fascia 
and dermis. Dotted lines delimit the wound or scar. Scale bars, 200 pm. 
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Extended Data Fig. 4 | Fascia EPFs express wound fibroblast markers. exception of DPP4 and all markers in fascia EPFs with the exception of DLK1, 
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b-f, Representative immunolabelling for the fibroblast markers DPP4(b),DLK1 fascia EPFs) images analysed from 4 biological replicates. One-way ANOVA, 
(c), CD24 (d), CD34 (e) and LY6A (f) from 4 biological replicates. g, Areas multiple comparison Tukey’s test, 95% CI. Dotted lines delimit the wound bed. 
analysed (top) for marker-positive EPF quantification (bottom). Dataare Scale bars, 200 pm. 
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Extended Data Fig. 5 | Differential expression of classical markers on fascia 
and dermal fibroblasts. a, Gating strategy for fibroblast (Lin) cytometry. 

b, Histogram plots of fibroblast-marker expression in fascia or dermis derived 
fibroblasts from three biological replicates. c, Fraction of marker-positive cells 
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(Lin” GFP’) sorting and detection of LY6A, PDGFR1, DPP4 and ITGB1 expression. 
Representative plots of three biological replicates. 
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Extended Data Fig. 7 | Fascia matrix forms the eschar and is remodelled in 
the wound. a, Left, in situ matrix tracing and collagen, II and VI 
immunolabelling at defined time points after wounding. Representative 
images of three biological replicates. Right, subsampled fractal dimension 
maps of the FITC signal of uninjured tissue and at 3 and 7 dpw, and the collagen 
signal at 14 dpw. b, Matrix label coverage from total collagen I, IIland VI signal in 
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(14 dpw) sections analysed from 3 biological replicates. One-way ANOVA, 
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Data exclusions No data was excluded from the analysis. 


Replication All experiments were performed at least three times with similar results. The live imaging data showed in Fig. 3d-e and extended data figure 
9d-e and h-k were generated from one representative video respectively. 


Randomization Mice were randomly divided into treatment groups. 


Blinding No experiments presented in this study required blinding. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 

[| Antibodies |_| ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Primary antibodies used in IHC: 
goat-anti-aSMA (Abcam, ab5694, 1:50) 
rabbit-anti-TUBB3 (Abcam, ab18207, 1:100) 
rat-anti-THY1(CD90) (Abcam, ab3105, 1:100) 
rat-anti-CD24 (BD biosciences, 553146, 1:50) 
rabbit-anti-DPP4(CD26) (Abcam, ab28340, 1:150) 
rabbit-anti-PECAM1(CD31) (Abcam, ab28364, 1:10) 
rat-anti-CD34 (Abcam, ab8158, 1:100) 
rabbit-anti-COLLAGEN | (Rockland, 600-401-103-0.1, 1:150) 
rabbit-anti-COLLAGEN III (Abcam, ab7778, 1:150) 
rabbit-anti-COLLAGEN VI (Abcam, ab6588, 1:150) 
rabbit-anti-DLK1 (Abcam, ab21682, 1:200) 
rat-anti-ERTR7 (Abcam, ab51824, 1:200) 
rat-anti-F4/80 (Abcam, ab90247, 1:400) 
rabbit-anti-LYVE1 (Abcam, ab14917, 1:100) 
rat-anti-MOMA2 (1:100, ab33451, Abcam) 
goat-anti-PDGFRA (R&D systems, AF1062, 1:50) 
rat-anti-LY6A (Biolegend, 122502, 1:150) 
rat-anti-CD44 (Abcam, ab119348, 1:100) 
rabbit-anti-NOV/CCN3 (Elabscience, E-AB-15124, 1:20) 
sheep-anti-FAP (R&D systems, AF3715, 1:100) 


Seconary antibodies: 


PacificBlue-, AlexaFluor488-, AlexaFluor568, or AlexaFluor647-conjugated secondary antibodies against suitable species (Life 
technologies, 1:500) 


Antibodies used for flow cytometry: 

anti-DLK1 (Abcam, ab21682, 1:200) 

FITC-anti-CD9 (eBioscience, 11-0091-81, 1:200) 

anti-CD271(LNGFR) (Miltenyi, 130-110, 1:100) 

APC-anti-F4/80 (eBioscience, 17-4801-82, 1:200) 
AlexaFluor790-anti-NG2 (Santa Cruz, sc-166251 AF790, 1:100) 
FITC-anti-DPP4(CD26) (BioLegend, 137806, 1:200) 
PerCP-eFluor710-anti-ITGB1(CD29) (eBioscience, 46-0291-80, 1:200) 
FITC anti-CD34 (eBiosceince, 11-0341-82, 1:200) 
PerCP-Cy5.5-anti-CD24 (eBioscience, 45-0242-80, 1:200) 
APC-anti-ITGA7 (Miltenyi Biotec, 130-103-356, 1:100) 
PerCP-Cy5.5-anti-LY6A(Sca1) (eBioscience, 45-5981-80, 1:200) 
Pp 

Pp 

A 

e 

A 

A 

A 


E-Vio770-anti-PDGFRA (Miltenyi, 130-105-117, 1:100) 
erCP-Vio700-anti-CD146 (Miltenyi, 130-103-865, 1:100) 
PC-anti-PECAM1(CD31) (BioLegend, 102410, 1:200) 
Fluor660-anti-LYVE1 (eBioscience, 50-0443-82, 1:200) 
PC-LY76(TER119) (BioLegend, 116212, 1:200) 
PC-anti-EPCAM(CD326) (BioLegend, 118214, 1:200) 
PC-anti-PTPRC(CD45) (BioLegend, 103112, 1:200) 


Validation See manufacturers’ notes. Antibodies were additionally validated using respective isotype antibodies in immunofluorescence 
assays. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Following mouse strains were used, both males and females, adult at 8-12 weeks old or neonatal at postnatal PO-P2: 
C57BL/6J wild type 
En1Cre (En1tm2(cre)Wrst) 
R26mTmG (Gt(ROSA)26Sortm4(ACTB-tdTomato,-EGFP)Luo) 
R26VT2/GK3 aka "Rainbow" (Gt(ROSA)26Sortm1(CAG-EGFP,-mCerulean,-mOrange,-mCherry)Ilw) 
R26iDTR (Gt(ROSA)26Sortm1(HBEGF)Awai) 
RAG2-/- (Rag2tm1.1Cgn) 
Fox Chase SCID (CB17/Icr-Prkdcscid/IcricoCrl) 


Wild animals The study did not involve wild animals. 
Field-collected samples The study did not involve samples collected from field. 
Ethics oversight Government of Upper Bavaria , Germany 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics healthy donors between 18 - 65 years of age, both genders, underwent surgeries for skin tumors. 


Recruitment Fresh human skin and scar biopsies, from various anatomic locations, were collected through the Department of Dermatology 
and Allergology, Klinikum rechts der Isar, Technical University Munich. Informed consent was obtained from all subjects prior to 
skin biopsies. 

Ethics oversight Ethikkommission der Bayrischen Landesarztekamme, reference number 85/185. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a ‘group’ is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation Fascia and dermis were physically separated from the back-skin of CS7BL6/J or En1Cre;R26mTmG mice under the fluorescence 
stereomicroscope as before. Harvested tissue was minced with surgical scissors and digested with an enzymatic cocktail 
containing 1 mg/ml Collagenase IV, 0.5 mg/ml Hyaluronidase, and 25 U/ml DNase | (Sigma-Aldrich) at 37°C for 30 min. The 
resulted single cell suspension was filtered and incubated with conjugated/unconjugated primary antibodies (dilution 1:200) at 
4° C for 30 min, followed by an incubation with a suitable secondary antibody when needed at 4° C for 30 min. Cells were 
washed and stained with Sytox blue dye (dilution 1:1000. Life technologies, S34857) for dead cell exclusion. 
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Instrument Cells were subjected to flow cytometric analysis using a FACSAria III (BD Bioscience) 
Software Data was collected using the BD FACSDIVA software (BD Bioscience) and analyzed using Summit software (Cytomation) 


Cell population abundance _ The purity of sorted cells were determined by flow cytometric analysis of the sorted cells with the same gating strategy as during 
sorting. 


Gating strategy The single cells were gated base on FSC/SSC and FSC-Area/FSC-width. The viable cells were gated within SytoxBlue negative gate, 
and fibroblasts were gated within lineage-negative (APC-negative) gate (antibodies against lineage markers are APC-conjugated 
anti-CD45, anti-Ter119, anti-EpCAM, anti-CD31, anti-Lyve1, and anti-Tie2). 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Chimeric antigen receptor (CAR) T cells mediate anti-tumour effects in a small subset 
of patients with cancer’ °, but dysfunction due to T cell exhaustion is an important 
barrier to progress* ©. To investigate the biology of exhaustion in human T cells 
expressing CAR receptors, we used a model system with a tonically signaling CAR, 
which induces hallmark features of exhaustion®. Exhaustion was associated witha 
profound defect in the production of IL-2, along with increased chromatin 
accessibility of AP-1 transcription factor motifs and overexpression of the bZIP and 
IRF transcription factors that have been implicated in mediating dysfunctionin 
exhausted T cells” *°. Here we show that CAR T cells engineered to overexpress the 
canonical AP-1 factor c-Jun have enhanced expansion potential, increased functional 
capacity, diminished terminal differentiation and improved anti-tumour potency in 
five different mouse tumour models in vivo. We conclude that a functional deficiency 
inc-Jun mediates dysfunction in exhausted human T cells, and that engineering CAR 
T cells to overexpress c-Jun renders them resistant to exhaustion, thereby addressing 


a major barrier to progress for this emerging class of therapeutic agents. 


CAR-expressing T cells demonstrate impressive response rates in B 
cell malignancies, but fewer than 50% of patients experience long- 
term disease control” and CAR T cells have not mediated sustained 
responses in solid tumours’. Several factors limit the efficacy of CAR 
T cells, including a requirement for high antigen density for optimal 
CAR function enabling rapid selection of antigen loss or antigen low 
variants” “*, the suppressive tumour microenvironment” and intrin- 
sic T cell dysfunction due to T cell exhaustion®””’. T cell exhaustion 
has been increasingly incriminated as a cause of CAR T cell dysfunc- 
tion®"*”, raising the prospect that engineering exhaustion-resistant 
CAR T cells could improve clinical outcomes. 

T cell exhaustion is characterized by high expression of inhibitory 
receptors and widespread transcriptional and epigenetic altera- 
tions**”"®", but the mechanisms responsible for impaired function 
in exhausted T cells are unknown. Blockade of PD-1 can reinvigorate 
some exhausted T cells”° but does not restore function fully, and trials 
using PD-1 blockade in combination with CART cells have not demon- 
strated efficacy”. Using a model in which healthy T cells are driven to 
exhaustion by the expression of a tonically signalling CAR, exhausted 
human T cells demonstrated widespread epigenomic dysregulation 
of AP-1 transcription factor-binding motifs and increased expression 
of the bZIP and IRF transcription factors that have been implicated in 
the regulation of exhaustion-related genes. Therefore, we tested the 


hypothesis that dysfunction in this setting resulted from an imbalance 
between activating and immunoregulatory AP-1-IRF complexes by 
inducing overexpression of c-Jun—an AP-1 family transcription fac- 
tor associated with productive T cell activation. Consistent with this 
hypothesis, overexpression of c-Jun rendered CAR T cells resistant to 
exhaustion, as demonstrated by enhanced expansion potential in vitro 
andin vivo, increased functional capacity, diminished terminal differen- 
tiation and improved anti-tumour potency in multiple in vivo models. 


HA-28z CAR rapidly induces T cell exhaustion 


Exhaustion in human T cells was recently demonstrated after expres- 
sion of a CAR incorporating the disialoganglioside (GD2)-specific 
14g2a scFv, CD3¢ and CD28 signalling domains (GD2-28z), as a result 
of tonic signalling mediated via antigen-independent aggregation®. 
Here we show that CARs incorporating the 14g2a-E101K scFv, which 
demonstrate higher affinity for GD2” (HA-28z), display amore severe 
exhaustion phenotype (Extended Data Fig. 1la—c). In contrast to CD19- 
28z CART cells (without tonic signalling), HA-28z CAR T cells develop 
profound features of exhaustion, including reduced expansion in cul- 
ture, increased expression of inhibitory receptors, exaggerated effec- 
tor differentiation, and diminished IFNy and markedly decreased IL-2 
production after stimulation (Fig. la-d, Extended Data Fig. 1d, e). The 
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Fig. 1| HA-28z CART cells manifest phenotypic, functional, transcriptional 
and epigenetic hallmarks of T cell exhaustion. a, Primary T cell expansion. 
Data are mean+s.e.m. fromn=10 independent experiments. b, Surface 
expression of exhaustion-associated markers. c, Surface expression of CD45RA 
and CD62L to distinguish T memory stem cells (CD45RA*CD62L’), central 
memory cells (CD45RA CD62L’), and effector memory cells (CD45RA CD62L ). 
d, IL-2 (left) and IFNy (right) release after 24-h co-culture with CD19*GD2* 
Nalm6-GD2 leukaemia cells. Data are mean +s.d. from triplicate wells. In b-d, 
one representative donor (of n=10 experiments) is shown for each assay. 
Pvalues determined by unpaired two-tailed t-tests. e, Principal component 
analysis (PCA) of global transcriptional profiles of naive- and central-memory- 
derived CD19-28z (CD19) or HA-28z (HA) CART cells at days 7,10 and 14 in 


functional defects are due to exhaustion-associated dysfunction rather 
than suboptimal interaction of the HA-28z CAR with its target GD2, 
because they are also observed in CD19-28z CAR T cells when HA-28z 
CAR is co-expressed using a bi-cistronic vector (Extended Data Fig. If). 
Principal component analysis (PCA) of RNA-sequencing (RNA-seq) data 
demonstrated that the strongest driver of transcriptional variance was 
the presence of the exhausting HA-28z versus control CD19-28z CAR 
(Fig. le), although some cell-type-specific differences were observed 
(Extended Data Fig. 1g). 

The top 200 most differentially expressed genes (Fig. 1f, Supple- 
mentary Table 1) included activation-associated genes (IFNG, GZMB 
and /L2RA), inhibitory receptors (LAG3 and CTLA4) and inflammatory 
chemokines or cytokines (CXCL8, /L13 and /LIA), and genes associated 
with naive and memory T cells (/L7R, TCF7, LEF1 and KLF2), which over- 
lapped with gene sets described in chronic lymphocytic choriomenin- 
gitis virus (LCMV) mouse models‘ (Extended Data Fig. 1h). Single-cell 
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culture. PC1 (39.3% variance) separates CD19-28z from HA-28z CART cells. 

f, Gene expression of the top 200 genes driving PC1. Genes of interest ineach 
cluster are listed. g, Differentially accessible chromatin regions (peaks) in CD8* 
CD19-28z and HA-28z CART cells. Both naive and central memory cell subsets 
are incorporated for each CAR. h, PCA of ATAC-seq chromatin accessibility in 
CD19-28z or HA-28z CART cells. PC1 (76.9% variance) separates CD19-28z from 
HA-28z CAR samples. i, Global chromatin accessibility profile of CD4* and 

CD8* CD19-28z and HA-28z CART cells derived from naive (N) and central 
memory (CM) subsets. Top 5,000 peaks. j, Differentially accessible enhancer 
regions in CD19- and HA-28z CAR T cells inthe CTLA4 (top) or /L7R (bottom) loci. 
Unless noted otherwise, all analyses were done on day 10 of culture. G/TRis also 
knownas TNFRSFI18. 


RNA-seq analysis of GD2-28z versus CD19-28z CAR T cells revealed 
similar differential gene expression as HA-28z CAR T cells (Extended 
Data Fig. 2). 

T cell exhaustion is associated with widespread epigenetic 
changes'®”°. Using ATAC-seq (assay for transposase-accessible chro- 
matin using sequencing) analysis” (Fig. 1g, Extended Data Figs. 3, 4a), 
we observed that CD8* HA-28z CART cells displayed more than 20,000 
unique differentially accessible chromatin regions (peaks) compared 
with less than 3,000 unique peaks in CD8* CD19-28z CAR T cells (false 
discovery rate (FDR) < 0.1and log,-transformed fold change > 1). Princi- 
pal component analysis (PCA) revealed HA-28z versus CD19-28z CAR as 
the strongest driver of differential chromatin states (PC1 variance 79.6%, 
Fig. 1h), with weaker but observable differences observed between 
naive and central memory cells (PC2 variance 7.4%), and CD4 versus 
CD8 subsets (PC3 variance 6.5%; Extended Data Fig. 4b). Clustering the 
top 5,000 differentially accessible regions revealed a similar epigenetic 
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Fig. 2| AP-1 family signature in exhausted CART cells. a, Top 25 transcription 
factor motif deviation scores in day 10 HA-28z (HA) versus CD19-28z (CD19) CAR 
T cells by chromVAR analysis. b, Top transcription factor (TF) motifs enriched 
in naive CD8* HA-28z CART cells. c, Bulk RNA-seq expression (fold change 
HA/CD19) of indicated AP-1-bZIP and IRF family members in CD19-28z (black) 
and HA-28z (blue) CART cells. Data are mean+s.e.m. fromn=6 samples across 
three donors. *P< 0.05, **P<0.01, ***P< 0.001, two-tailed ratio t-tests 

(see Supplementary Information for exact Pvalues). d, Increased protein 


state in HA-28z CART cells regardless of the starting subset (Fig. 1i). HA- 
28z CART cells demonstrated increased chromatin accessibility near 
exhaustion-associated genes such as CTLA4, and decreased accessibility 
near memory-associated genes such as /L7R (Fig. 1j). Together, these 
data suggest that tonically signalling CART cells area valid model for 
the study of human T cell exhaustion. 


Epigenetic and transcriptional dysregulation of AP-1 


Using ChromVAR™ and transcription factor motif enrichment analysis 
to identify transcriptional programs associated with the epigenetic 
changes observed, we discovered that the AP-I-bZIP and bZIP-IRF bind- 
ing motifs were the most significantly enriched in exhausted CAR T cells 
(Fig. 2a, b, Extended Data Fig. 4c, d), with strong enrichment of NF-KB, 
NFAT and RUNX transcription factor motifs in some clusters, reproduc- 
ing epigenetic signatures of exhaustion observed in other models®”°”, 
Paired RNA-seq analysis across three donors revealed increased bZIP 
and IRF mRNA in HA-28z versus CD19-28z CAR T cells, most significantly 
for JUNB, FOSL1, BATF, BATF3, ATF3, ATF4 and IRF4 (Fig. 2c, Extended 
Data Fig. 4e). We confirmed increased protein expression of JunB, IRF4 
and BATF3, with higher relative levels of the BATF and IRF4 transcrip- 
tion factors than the canonical AP-1 factor c-Jun (Fig. 2d, Extended 
Data Fig. 4f). Transcription factors of the AP-1 family form homo- and 
heterodimers through interactions in the common bZIP domain that 
compete for binding at DNA elements containing core TGA-G/C-TCA 
consensus motifs and can complex with IRF transcription factors”. 
The classic AP-1 heterodimer c-Fos-c-Jun drives transcription of /L2, 
whereas complexes containing other AP-1 and IRF family members can 
antagonize c-Jun activity and/or drive immunoregulatory gene expres- 
sioninT cells” °° 8, Co-immunoprecipitation analysis demonstrated 


loading control 


4.320 


expression of c-Jun, JunB, BATF3 and IRF4. in HA-28z versus CD19-28z CAR 
Tcells at days 7, 10 and 14 of culture determined by immunoblotting. 

e, Immunoprecipitation of c-Jun and JunB complexes demonstrates that 
HA-28z CAR T cells contain more c-Jun heterodimers, as well as JunB-IRF4, 
JunB-BATF and JunB-BATF3 heterodimers, than CD19-28z CART cells. 

f, Correlation network of exhaustion-related transcription factors in naive- 
derived CD4* GD2-28z CART cells using single-cell RNA-seq analysis. For gel 
source data, see Supplementary Fig. 1. 


increased levels of complexed JunB, BATF, BATF3 and IRF4 in HA-28z 
CAR T cells (Fig. 2e) and single-cell RNA-seq analysis of CD19-28z and 
GD2-28z CART cells confirmed that the bZIP family members JUN, 
JUNB,JUND and ATF4 were among the most differentially expressed 
and broadly connected in exhausted GD2-28z CAR T cell networks 
(Fig. 2f, Extended Data Figs. 2, 4g). We observed a similar pattern of 
AP-Land BATF/IRF4 imbalance in a single-cell gene expression dataset 
from patients with metastatic melanoma undergoing treatment with 
immune checkpoint blockade” (Extended Data Fig. 4h). 


c-Jun overexpression prevents CAR T cell exhaustion 


We hypothesized that T cell dysfunction in exhausted cells might be 
due to arelative deficiency in c-Jun—c-Fos AP-1 heterodimers. Indeed, 
HA-28z CAR T cells overexpressing AP-1 demonstrated increased pro- 
duction of IL-2, which required c-Jun but not c-Fos (Extended Data 
Fig. 4i-m), whereas no benefit was observed in CD19-28z CAR T cells. 
To further investigate c-Jun overexpression in exhausted T cells, we 
created JUN-P2A-CAR bi-cistronic vectors (Fig. 3a, b) and demonstrated 
enhanced c-Jun N-terminal phosphorylation (JNP) only inJUN-HA-28z 
(Fig. 3c), consistent withJNK kinase activation via the HA-28z-associated 
tonic signal®°. Antigen-stimulated JUN-HA-28z CAR T cells demon- 
strated remarkably increased production of IL-2 and IFNy, although 
no significant differences were observed inJUN-CD19-28z CAR T cells 
(Fig. 3d, e, Extended Data Fig. 5b, c). We also observed enhanced func- 
tional activity of JUN-HA-28z CAR T cells at the single-cell level in both 
CD4* and CD8* CART cells, and c-Jun overexpression increased the 
frequency of stem-cell-memory and central-memory versus effec- 
tor and effector-memory subsets in CD4* and CD8* populations of 
exhausted CAR T cells, but notin healthy CAR T cells (Fig. 3f, Extended 
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Fig. 3| c-Jun overexpression enhances the function of exhausted CART cells. 
a, JUN-P2A-HA-CAR expression vector. HTM, hinge/transmembrane; ICD, 
intracellular domain. b, Intracellular c-Jun expression in control (Ctrl) andJUN 
CART cells at day 10 by flowcytometry. Grey denotes isotype control. 

c, Immunoblot for total c-Jun and phosphorylated c-Jun (p-c-Jun(S73)) in 
control (Ctrl) andJUN CART cells at day 10. d, e, IL-2 (d) and IFNy (e) production 
after 24-hco-culture of control or JUN CD19-28z and HA-28z CART cells in 
response to antigen-positive tumour cells. Dataare mean + s.d. of triplicate 
wells. Pvalues determined by unpaired two-tailed t-tests. One representative 
donor. Fold change across n=8 donors in Extended Data Fig. 5. f, Left, flow 
cytometry showing representative expression of CD45RA and CD62L in control 
andJUN CAR T cells at day 10. Right, relative frequency of effector 


Data Fig. 5d, e). Together, the data are consistent with a model in which 
c-Jun overexpression is functionally more significant in exhausted 
T cells, which express higher levels of immunomodulatory bZIP and 
IRF transcription factors. 

Overexpression of c-Jun also enhanced long-term proliferative capac- 
ity, which is associated with anti-tumour effects in solid tumours”, in 
CAR T cells without tonic signalling (CD19-28z, CD19-BBz) (Extended 
Data Fig. 5f). Enhanced proliferation remained IL-2-dependent, as 
expansion immediately ceased after IL-2 withdrawal (Fig. 3g, Extended 
Data Fig. 5g). Expanding CD8* JUN-CD19-28z CAR T cells displayed 
diminished exhaustion markers and an increased frequency of cells 
bearing the stem-cell memory phenotype compared with control CD19- 
28z CAR T cells (Extended Data Fig. 5h-j). c-Jun overexpression also 
increased homeostatic expansion of both CD19-28z and CD19-BBz CAR 
T cells intumour-free NSG (NOD-SCID //2rg-null) mice (Fig. 3h), which 
led to accelerated GVHD in the JUN-CD19-BBz CAR-T-cell-treated mice. 
Together, the data demonstrate that c-Jun overexpression mitigates 
Tcell exhaustion in numerous CARs tested, including those incorporat- 
ing CD28 or 4-1BB costimulatory domains, and regardless of whether 
exhaustion is driven by long-term expansion or tonic signalling. 


Molecular mechanisms of c-Jun in exhaustion 


To explore the mechanism by which c-Jun overexpression prevents T cell 
dysfunction, we compared ATAC-seq and RNA-seq results of HA-28z 
and JUN-HA-28z CAR T cells. Overexpression of c-Jun did not change 
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cD4* 


CD8* 


cD4* 


CD8* 


(E; CD45RA*CD62L ), stem-cell memory (SCM; CD45RA*CD62L’), central 
memory (CM; CD45RA CD62L*), and effector memory (EM; CD45RA CD62L ) in 
CD8* control or JUN-HA-28z CAR T cells (n=6 donors from independent 
experiments). Lines indicate paired samples from the same donor. Pvalues 
determined by paired two-tailed t-tests. g, On day 39, 1x 10° viable T cells from 
Extended Data Fig. 5f were re-plated and cultured for 7 days with or without 
IL-2. h, Control orJUN CD19-28z or CD19-BBz CART cells from g were 
cryopreserved on day 10 and later thawed, rested overnight in IL-2 and 5 x 10° 
cells were then injected intravenously into healthy NSG mice. On day 25 after 
infusion, peripheral blood T cells were quantified by flow cytometry. Data are 
mean +s.e.m. ofn=5 mice per group. Pvalues determined by unpaired 
two-tailed t-tests. 


the epigenetic profile but substantially modulated the transcriptome, 
with 319 genes differentially expressed inJUN and control HA-28z CAR 
T cells (Extended Data Fig. 6a—c), including reduced expression of 
exhaustion-associated genes and increased expression of memory 
genes. Using DAVID, we confirmed that genes changed by c-Jun are 
highly enriched for AP-1 family binding sites (Extended Data Fig. 6d), 
which suggests that gene expression changes were mediated by AP-1 
family transcription factors. 

We postulated that c-Jun overexpression could rescue exhausted 
T cells by direct transcriptional activation of AP-1 target genes and/or 
by indirectly disrupting immunoregulatory AP-1-IRF transcriptional 
complexes”’ that drive exhaustion-associated gene expression (AP-1i) 
(Extended Data Fig. 6e). To test these non-mutually-exclusive hypothe- 
ses, we first evaluated a panel of c-Jun mutants predicted to be deficient 
in transcriptional activation (JUN-AA, JUN-A6, JUN-ATAD), DNA bind- 
ing (JUN-Abasic) or dimerization (JUN-ALeu, JUN-AbZIP)” * (Fig. 4a, 
Extended Data Fig. 6f). JUN-AA and JUN-A6S both equivalently increased 
IL-2 and IFNy production compared to wild-type c-Jun in HA-28z CAR 
T cells (Fig. 4b), whereas JUN-ATAD demonstrated partial rescue in 
IL-2 production. Conversely, C-terminal mutants (JUN-Abasic, JUN- 
ALeu andJUN-AbZIP), which were unable to bind chromatin (Extended 
Data Fig. 6g), did not rescue cytokine production in exhausted HA-28z 
CART cells (Fig. 4b). Furthermore, c-Jun overexpression substantially 
decreased levels of AP-1i, as evidenced by diminished mRNA levels 
(Extended Data Fig. 6h), reduced total and chromatin-boundJunB, BATF 
and BATF3 proteins, and reduced JunB-BATF complexes (Extended Data 
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Fig. 4| c-Jun functional rescue of exhaustion requires bZIP dimerization but 
is independent of transactivation. a, Schematic of c-Jun protein showing 
N-terminal transactivation domain (TAD) and C-terminal bZIP domain deletion 
mutants. Red asterisks denote JNP sites at Ser63 and Ser73 mutated to alanine 
inJUN-AA. b, IL-2 (left) and IFNy (right) production by control or JUN-HA-28z 
CART cells expressing the indicated c-Jun variant after 24-h stimulation with 
Nalm6-GD2 (N6-GD2) or 143B target cells. Dataare mean +s.d. of triplicate 
wells; representative of three independent experiments. c, Immunoblot of 
indicated AP-1and IRF proteins in control, JUN-WT or JUN-AbZIP HA-28z CAR 


Fig. 7a-c). Importantly, c-Jun-mediated displacement of JunB, BATF 
and BATF3 from chromatin and reduced JunB-BATF complexes were 
dependent on the ability of c-Jun to partner with AP-1 family members 
(Fig. 4c, d). Consistent with this, c-Jun and IRF4 chromatin immuno- 
precipitation followed by high-throughput sequencing (ChIP-seq) 
analysis identified no novel c-Jun-binding sites after c-Jun overexpres- 
sion. Instead, the vast majority of sites bound by c-Jun are also bound 
by IRF4 (and probably BATF), consistent with c-Jun overexpression 
increasing binding almost exclusively at AP-1-IRF composite elements, 
including near exhaustion-associated genes regulated by IRF4, and 
genes associated with increased T cell proliferation and functional 
activation (Extended Data Fig. 7d-h). Finally, JunB-knockout, BATF- 
knockout and especially IRF4-knockout significantly increased IL-2 and 
IFNy production in HA-28z CAR T cells (Fig. 4e, Extended Data Fig. 7i,j). 
Time-course experiments using a drug regulatable expression model 
of c-Jun revealed that full rescue required c-Jun overexpression during 
both T cell expansion and antigen stimulation (Extended Data Fig. 8), 
consistent with a model in which c-Jun overexpression both modulates 
molecular reprogramming during the development of exhaustion 
and augments responses during acute stimulation downstream of 
antigen encounter. Together, the data are consistent with a model in 
which an overabundance of AP-1-IRF complexes drives the exhaustion 
transcriptional program and c-Jun overexpression prevents exhaustion 
by decreasing and/or displacing AP-1i complexes from chromatin. 


JUN CAR T cells enhance anti-tumour activity in vivo 


Using a Nalm6-GD2* leukaemia model, we confirmed functional superi- 
ority of JUN-HA-28z CAR T cells in vivo (Fig. 5a—c), which required c-Jun 
dimerization but not transactivation (Extended Data Fig. 7k, |). In an 
in vitro model of limiting antigen dilution, JUN-HA-28z CART cells pro- 
duced greater maximal IL-2 and IFNy and manifested alower threshold 
for antigen-induced IL-2 secretion (Fig. 5d, e). Limiting target antigen 


Tcells in soluble or chromatin-bound lysis fractions. d, Immunoblot of 
indicated AP-1and IRF proteins in control, JUN-WT or JUN-AbZIP HA-28z CAR 
Tcellsin total lysate (right) or after JunBimmunoprecipitation (IP, left).e, Fold 
change (FC) in IL-2 (top) and IFNy (bottom) production in AP-1 or IRF4 CRISPR- 
knockout (KO) HA-28z CAR T cells after 24-h stimulation with Nalm6-GD2 or 
143B target cells. Fold change in cytokine production is normalized to control 
HA-28z CART cells. Data are mean+s.e.m. of n=6 independent experiments. 
Pvalues determined using nonparametric Mann-Whitney Utests. 


expression is increasingly recognized to limit CAR functionality as 
observed after treatment of CD22-BBz-CAR T cells in patients with 
relapsed or refractory leukaemia’””**>. We therefore assessed whether 
c-Jun overexpression could enhance the capacity to target antigen-low 
tumour cells. In response to CD22" leukaemia, JUN-CD22-BBz CAR 
T cells exhibited increased cytokine production in vitro (Fig. 5f-h) and 
markedly increased anti-tumour activity in vivo (Fig. Si-l) compared 
with control CD22-BBz CAR T cells. Similar results were observed ina 
CD19" Nalm6 leukaemia model (Extended Data Fig. 9a-f). 


c-Jun decreases hypofunction within solid tumours 


c-Jun overexpression also enhanced the functionality of CARs targeting 
solid tumours. JUN-Her2-BBz CAR T cells prevented 143B osteosarcoma 
tumour growth in vivo, markedly improved long-term survival, and 
greatly increased T cell expansion (Extended Data Fig. 9g-i). Similar 
results were observed when comparing GD2-BBz and JUN-GD2-BBz 
CAR T cells against 143B (Extended Data Fig. 9j-n). c-Jun overexpres- 
sionincreased the frequency of total and CAR* Her2-BBzT cells within 
tumours (Fig. 6a, b), reduced expression of exhaustion markers PD-1 
and CD39 (Fig. 6c), and substantially increased cytokine production 
after ex vivo re-stimulation (Fig. 6d, e, Extended Data Fig. 10a—c). Sin- 
gle-cell RNA-seq of purified tumour infiltrating JUN-Her2-BBz CAR 
T cells demonstrated increased frequency of cells within the G2/M 
and S phases of the cell cycle (Fig. 6f), amore activated transcriptional 
program (as measured by /L2RA and CD38), and downregulation of 
numerous exhaustion-associated genes (PDCD1, BTLA, TIGIT, CD200, 
ENTPD1 and NR4Az2) (Fig. 6g and Extended Data Fig. 10d-g). Finally, a 
small cluster of T cells characterized by high /L7R expression (/L7R, 
KLF2, CD27, TCF7 and SELL) was preserved in tumours treated with 
JUN CAR T cells but not those receiving control Her2-BBz CAR T cells 
(Extended Data Fig. 10g), consistent with c-Jun-induced maintenance 
of amemory-like population capable of self-renewal. 
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Fig. 5 |JUN-modified CART cells increase in vivo activity against leukaemia 
and enhance T cell function under suboptimal stimulation. a—c, NSG mice 
were injected intravenously with 1 x 10° Nalm6-GD2 leukaemia cells, and then 

3 x 10° mock, HA-28z or JUN-HA-28z CAR‘ T cells were given intravenously on 
day 3.a,c, Tumour progression was monitored using bioluminescent imaging. 
Scales are normalized for all time points. D, day. b, JUN-HA-28z CART cells 
induced long-term tumour-free survival. Data are mean+s.e.m. ofn=5 mice 
per group. Reproducible in three independent experiments; however, insome 
experiments, long-term survival was diminished owing to outgrowth of GD2(-) 
Nalm6 clones. d, e, IL-2 (d) and IFNy(e) production after 24 h stimulation of 
control orJUN HA-28z CART cells with immobilized 1A7 anti-CAR idiotype 
antibody. Each curve was fit with nonlinear dose response kinetics to 
determine half-maximal effective concentration (EC.,) values. Smaller graphs 
(right) highlight antibody concentrations less than1 pg ml’. Data are 

mean +s.d. of triplicate wells; representative of two independent experiments. 


Discussion 


Several lines of evidence implicate exhaustion in limiting the potency 
of CAR T cells®”"*"”. Using a tonically signalling CAR that can induce the 
hallmark features of exhaustion in a controlled in vitro culture system, 
we identified AP-1-related bZIP-IRF families as major factors that drive 
exhaustion-associated gene expression. We tested the hypothesis that 
exhaustion-associated dysfunction results from increased levels of AP-1- 
IRF complexes leading toa functional deficiency in activating AP-1Fos/Jun 
heterodimers. Consistent with this model, c-Jun overexpression prevented 
phenotypic and functional hallmarks of exhaustion and improved anti- 
tumour control in five tumour models, including the clinically relevant 
GD2-BBz and Her2-BBz CART cells and CD19 CAR T cells subjected to 
prolonged ex vivo expansion. JUN CAR T cells also demonstrated increased 
potency when encountering tumour cells with low antigen density. 
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f, JUN-CD22-BBz retroviral vector. g, CD22 surface expression on Nalm6 wild- 
type (N6-22""), Nalm6-CD22-knockout (N6-22*°) and Nalm6-22° plus CD22'™ 
(N6-22'”) cells. h, IL-2 (left) and IFNy (right) release after co-culture of Nalm6 
and Nalm6-22' cells with control or JUN-CD22-BBz CART cells. Dataare 
mean +s.d. of triplicate wells; representative of three independent 
experiments. i-I, NSG mice were inoculated with1 x 10° Nalm6-22' leukaemia 
cells intravenously. On day 4, 3 x 10° mock, control or JUN-CD22-BBz CAR* 
Tcells were transferred intravenously. i, l, tumour growth was monitored by 
bioluminescent imaging. j, Mice receiving JUN-CD22-BBz CART cells display 
increased peripheral blood T cells on day 23. k, Long-term survival of 
CAR-treated mice. Datainiandjaremean+s.e.m.ofn=5 mice per group; 
representative of two independent experiments. Unless otherwise noted, 
Pvalues determined by unpaired two-tailed ¢-tests. Survival curves were 
compared using the log-rank Mantel-Cox test. 


Mechanistically, c-Jun overexpression could work by directly enhanc- 
ing c-Jun-mediated transcriptional activation of genes suchas /L2, and/ 
or indirectly by disrupting or displacing AP-1i. Substantial orthogonal 
data are consistent with the indirect displacement model. First, the ina- 
bility of Fos overexpression to enhance function is consistent with the 
displacement model, as Fos has not been described to heterodimerize 
with BATF proteins. Second, c-Jun mutant experiments demonstrated 
a crucial role for dimerization but not transactivation in the biology 
observed. Third, we observed a reduction in total and chromatin-bound 
JunB, BATF and BATF3 after c-Jun overexpression, and could reproduce 
functional enhancement of exhausted T cells after knockout of /RF4and 
JUNB. Anindirect model in which c-Jun blocks access of AP-1i complexes 
to enhancer regions is also consistent with the previous finding that 
BACH2 protects from terminal effector differentiation by blocking AP-1 
sites* as terminal effector differentiation is a hallmark of exhaustion in 
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Fig. 6| c-Jun overexpression enhances CART cell efficacy and decreases 
hypofunction within solid tumours. NSG mice were inoculated with 1 x 10° 
143B osteosarcoma cells viaintramuscular injection, and then 110’ mock, 
Her2-BBz or JUN-Her2-BBz CART cells were given intravenously on day 14. 

a, Tumour growth (monitored by caliper measurements). b-g, On day 28, mice 
were euthanized and tumour tissue was collected and mechanically 
dissociated. Single-cell suspensions were labelled for analysis by flow 
cytometry (b,c), re-stimulated with Nalm6-Her2* target cells and analysed for 
intracellular cytokine production (d), or sorted by FACS to isolate live, human 
CD45* tumour-infiltrating lymphocytes (TILs) (e-g). b, Left, CD8* cells asa 
proportion of total live tumour cells. Right, CAR* cells as a proportion of total 


our model and is prevented by c-Jun overexpression. Another related 
hypothesis suggests that exhaustion results from partner-less NFAT 
in the absence of AP-1*’ and several recent publications implicated 
the NFAT-driven transcription factors NR4A**”’ and TOX*° * in T cell 
exhaustion. Overexpression of NR4A1 was shown to displace chromatin- 
bound c-Jun*’, suggesting that competition with NR4A family members 
might also contribute to the effects described here. Future studies are 
warranted to understand the functional overlap of NFAT, TOX and NR4A 
in c-Jun-overexpressing CART cells. 

Theimpressive effects of c-Jun overexpression in several preclinical 
tumour models raise the prospect of clinical testing of JUN CAR T cells. 
c-Junis the cellular homologue of the viral oncogene v-Jun, and c-Jun 
expression has been described in cancer****. However, c-Jun has not 
been implicated as an oncogene in mature T cells, which appear to 
be generally resistant to transformation, and we see no evidence for 
transformation in these studies. Ras-mediated transformationin rodent 
models requires JNP**, therefore, the JUN-AA mutant, which equally res- 
cues CAR T cell function, could be implemented to mitigate theoretical 
oncogenic risk. Future work is necessary to determine whether c-Jun 
overexpression might enhance the risk of other toxicities, including 
on-target and off-target effects. 

In summary, our findings highlight the power of a deconstructed 
model of human T cell exhaustion to interrogate the biology of this 
complex phenomenon. Using this approach, we discovered a funda- 
mental role for the AP-1-bZIP family in human T cell exhaustion and 
demonstrate that overexpression of c-Jun renders CART cells resistant 


live CD8* cells. c, PD-1°CD39* cells as a frequency of total live CD8* (left) with 
representative contour plots (right). d, Frequency of indicated cytokine- or 
CD107a-producing cells after 5-h re-stimulation with Nalm6-Her2’ target cells. 
Gated on total, live CD8* T cells (left) with representative contour plots (right). 
e, IL-2 secretion after 24-hre-stimulation of sorted CD45" TILs with Nalm6- 
Her2* target cells. Dataina-e are mean+s.e.m. of n=6-8 mice per group. 
Unless otherwise noted, Pvalues determined by unpaired two-tailed t-tests. 

f, Relative frequency of sorted CD45* TILsin each phase of the cell cycle as 
determined by single-cell RNA-seq. g, The log,-transformed fold change inJUN 
compared with control Her2-BBz CART cells for the indicated transcripts. 


to exhaustion, enhances their ability to control tumour growth in vivo, 
and improves the recognition of antigen-low targets, thus addressing 
major barriers to progress with this class of therapeutic agents. 
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Methods 


Viral vector construction 

MSGV retroviral vectors encoding the following CARs were previously 
described: CD19-28z, CD19-BBz, GD2-28z, GD2-BBz, Her2-BBz and 
CD22-BBz. To create the HA-28z CAR, a point mutation was introduced 
into the 14G2a scFv of the GD2-28z CAR plasmid to create the E101K 
mutation. The ‘4/2NQ’ mutations” were introduced into the CH2CH3. 
domains of the lgG1 spacer region to diminish Fc receptor recognition 
for in vivo use of HA-28z CAR T cells. 

Codon-optimized cDNAs encoding c-Jun (JUN), c-Fos (FOS), and 
truncated NGFR (tNGFR; NGFR) were synthesized by IDT and cloned 
into lentiviral expression vectors to create JUN-P2A-FOS, and JUN and 
FOS single expression vectors co-expressing tNGFR under the separate 
PGK promoter. JUN-P2A was then subcloned into the Xhol site of MSGV 
CAR vectors using the In-Fusion HD cloning kit (Takara) upstream of 
the CAR leader sequence to create JUN-P2A-CAR retroviral vectors. For 
JUN-AA, point mutations were introduced to convert Ser63 and Ser73 
to Ala. The other JUN mutants were cloned to remove portions of the 
protein as described in Fig. 4. The Escherichia coliDHFR-destabilization 
domain (DD) sequence was inserted upstream of Junto create JUN-DD 
fusion constructs. In some cases, GFP CDNA was subcloned upstream 
of the CAR to create GFP-P2A-CAR vector controls. For bi-cistronic CAR 
retroviral vectors, the HA-28z or Her2-28z CAR was cloned downstream 
of acodon-optimized CD19-28z CAR to create CD19-28z-P2A-HA-28z 
and CD19-28z-P2A-Her2-28z dual CAR expression vectors. 


Viral vector production 

Retroviral supernatant was produced in the 293GP packaging cell 
line as previously described®. In brief, 70% confluent 293GP 20-cm 
plates were co-transfected with 20 pg MSGV vector plasmid and 10 pg 
RD114 envelope plasmid DNA using Lipofectamine 2000. Medium 
was replaced at 24 and 48 h after transfection. The 48-h and 72-h viral 
supernatants were collected, centrifuged to remove cell debris, and 
frozen at -80 °C for future use. Third-generation, self-inactivating 
lentiviral supernatant was produced in the 293T packaging cell line. 
In brief, 70% confluent 293T 20-cm plates were co-transfected with 
18 pg pELNS vector plasmid, and 18 pg pRSV-Rev, 18 png pMDLg/pRRE 
(Gag/Pol) and 7 ug pMD2.G (VSVG envelope) packaging plasmid 
DNA using Lipofectamine 2000. Medium was replaced at 24 h after 
transfection. The 24-h and 48-h viral supernatants were collected, 
combined and concentrated by ultracentrifugation at 28,000 rpm 
for 2.5 h. Concentrated lentiviral stocks were frozen at —80 °C for 
future use. 


Tcellisolation 

Healthy donor buffy coats were collected by and purchased from the 
Stanford Blood Center under an IRB-exempt protocol. Primary human 
T cells were isolated using the RosetteSep Human T cell Enrichment 
kit (Stem Cell Technologies) according to the manufacturer’s protocol 
using Lymphoprep density gradient medium and SepMate-5O tubes. 
Isolated T cells were cryopreserved at 2 x 10’ T cells per vial in CryoStor 
CS10 cryopreservation medium (Stem Cell Technologies). 


CART cell production 

Cryopreserved T cells were thawed and activated same day with Human 
T-Expander CD3/CD28 Dynabeads (Gibco) at 3:1 beads:cell ratio in T cell 
medium (AIMV supplemented with 5% fetal bovine serum (FBS), 10 mM 
HEPES, 2mM GlutaMAX, 100 U mI penicillin and 100 pig mI" strepto- 
mycin (Gibco)). Recombinant human IL-2 (Peprotech) was provided 
at 100 U mI. T cells were transduced with retroviral vector on days 2 
and 3 after activation and maintained at 0.5 x 10°-1 x 10° cells per ml 
in T cell medium with IL-2. Unless otherwise indicated, CAR T cells 
were used for in vitro assays or transferred into mice on days 10-11 
afteractivation. 


Retroviral transduction 

Non-tissue culture treated 12-well plates were coated overnight at 4 °C 
with 1 ml Retronectin (Takara) at 25 pg ml in PBS. Plates were washed 
with PBS and blocked with 2% BSA for 15 min. Thawed retroviral super- 
natant was added at approximately 1 ml per well and centrifuged for 2 
hat 32 °C at 3,200 rpm before the addition of cells. 


CRISPR knockout 

CRISPR-Cas9 gene knockout was performed by transient Cas9/gRNA 
(RNP) complex electroporation using the P3 Primary Cell 4D-Nucleofec- 
tor X Kit S (Lonza). On day 4 of culture, HA-28z CAR T cells were counted, 
pelleted and resuspended in P3 buffer at 1.5 x 10°-2 x 10° cells per 20 pl 
reaction. 3.3ug Alt-R .Sp. Cas9 protein (IDT) and 40pmol chemically 
modified synthetic sgRNA (Synthego) (2:1 molar ratio gRNA:Cas9) 
per reaction was pre-complexed for 10 min at room temperature to 
create ribonucleoprotein complexes (RNP). A 20-pl cell suspension 
was mixed with RNP and electroporated using the EO-115 protocol 
in 16-well cuvette strips. Cells were recovered at 37 °C for 30 min in 
200 ul T cell medium then expanded as described above. Knockdown 
efficiency was determined using TIDE and/or immunoblot. Control 
HA-28z CAR T cells were electroporated with a gRNA targeting the safe- 
harbour locus AAVS1. The following gRNA target sequences were used: 
AAVS1- GGGGCCACTAGGGACAGGAT, JUNB-ACTCCTGAAACCGAGC 
CTGG, BATF-TCACTGCTGTCGGAGCTGTG, BATF3-CGTCCTGCAGAG 
GAGCGTCG, and IRF4-CGGAGAGTTCGGCATGAGCG. 


Celllines 

The Kelly neuroblastoma, EW8 Ewing’s sarcoma, 143b and TC32 osteo- 
sarcoma cell lines were originally obtained from ATCC. Insome cases, 
cell lines were stably transduced with GFP and firefly luciferase (GL). 
The CD19*CD22* Nalm6-GL B-ALL cell line was provided by D. Barrett. 
Nalm6-GD2 was created by co-transducing Nalm6-GL with cDNAs for 
GD2 synthase and GD3 synthase. The Nalm6-Her2 cell line was cre- 
ated using lentiviral overexpression of Her2 cDNA. Single-cell clones 
were then chosen for high antigen expression. Nalm6-22-knockout 
(Nalm6-22*°) and Nalm6-22*° plus CD22'” (N6-22') have been pre- 
viously described and were provided by T. Fry”. The Nalm6-CD19'™ 
cell lines were created by R. Majzner (manuscript in preparation). All 
celllines were cultured in complete media (CM) (RPMI supplemented 
with 10% FBS, 10 mM HEPES, 2 mM GlutaMAX, 100 U mI" penicillin, 
and 100 pg mI streptomycin (Gibco)). STR DNA profiling of all cell 
lines is conducted by Genetica Cell Line testing once per year. None 
of the cell lines used in this study is included in the commonly misi- 
dentified cell lines registry. Before using for in vivo experiments, cell 
lines were tested with MycoAlert detection kit (Lonza). All cell lines 
tested negative. 


Flow cytometry 

The anti-CD19 CAR idiotype antibody was provided by B. Jena and L. 
Cooper*’. The 1A7 anti-14G2a idiotype antibody was obtained from NCI- 
Frederick. CD22 and Her2 CARs were detected using human CD22-Fc 
and Her2-Fc recombinant proteins (R&D). The idiotype antibodies and 
Fc-fusion proteins were conjugated in house with Dylight488 and/or 
650 antibody labelling kits (Thermo Fisher). T cell surface phenotype 
was assessed using the following antibodies: 

From BioLegend: CD4-APC-Cy7 (clone OKT4), CD8-PerCp-Cy5.5 
(clone SK1), TIM-3-BV510 (clone F38-2E2), CD39-FITC or APC-Cy7 (clone 
Al), CD95-PE (clone DX2), CD3-PacBlue (clone HIT3a). 

From eBioscience: PD-1-PE-Cy7 (clone eBio J105), LAG-3-PE (clone 
3DS223H), CD45RO-PE-Cy7 (clone UCHL1), CD45-PerCp-Cy5.5 (clone 
HI30). 

From BD: CD45RA-FITC or BV711 (clone HI100), CCR7-BV421 (clone 
150503), CD122-BVS510 (clone Mik-83), CD62L-BV605 (clone DREG-56), 
CD4-BUV395 (clone SK3), CD8-BUV805 (clone SK1). 


Article 


Cytokine production 

Approximately 1 x 10° CAR‘ T cells and 1 x 10° tumour cells were cul- 
tured in 200 pI CM in 96-well flat bottom plates for 24 h. For idiotype 
stimulation, serial dilutions of 1A7 were crosslinked in 1x Coating Buffer 
(BioLegend) overnight at 4 °C on Nunc Maxisorp 96-well ELISA plates 
(Thermo Scientific). Wells were washed once with PBS and 1 x 10° CAR* 
T cells were plated in 200 pl CM and cultured for 24 h. Triplicate wells 
were plated for each condition. Culture supernatants were collected 
and analysed for IFNy and IL-2 by ELISA (BioLegend). 


Intracellular cytokine staining 

For Intracellular cytokine staining analysis, CAR’ T cells and target cells 
were plated at 1:1 effector:target ratio in CM containing 1x monensin 
(eBioscience) and 5 pl per test CD107a antibody (BV605, Clone H4A3, 
BioLegend) for 5-6 h. After incubation, intracellular cytokine staining 
was performed using the FoxP3 TF Staining Buffer Set (eBioscience) 
according to the manufacturer’s instruction using the following anti- 
bodies from BioLegend: IL2-PECy7 Clone MQ1-17H12, IFNy-APC/Cy7 
Clone 4S.B3, and TNFa-BV711 Clone Mab11. 


Incucyte lysis assay 

Approximately 5 x 10* GFP* leukaemia cells were co-cultured with 
CAR T cells in 200 pl CM in 96-well flat bottom plates for up to 120 h. 
Triplicate wells were plated for each condition. Plates were imaged 
every 2-3 husing the IncuCyte ZOOM Live-Cell analysis system (Essen 
Bioscience). Four images per well at 10x zoom were collected at each 
time point. Total integrated GFP intensity per well was assessed as a 
quantitative measure of live, GFP* tumour cells. Values were normal- 
ized tothe starting measurement and plotted over time. Effector:target 
ratios are indicated in the figure legends. 


Immunoblotting and immunoprecipitations 

Whole-cell protein lysates were obtained in non-denaturing buffer 
(150 mM NaCl, 50 mM Tris pH 8, 1% NP-10, 0.25% sodium deoxycho- 
late). Protein concentrations were estimated by Bio-Rad colorimetric 
assay. Immunoblotting was performed by loading 20 pg of protein 
onto 11% PAGE gels followed by transfer to PVF membranes. Signals 
were detected by enhanced chemiluminescence (Pierce) or with the 
Odyssey imaging system. Representative blots are shown. The follow- 
ing primary antibodies used were purchased from Cell Signaling: c-Jun 
(60A8), P-c-Jun®*"”? (D47G9), JunB(C37F9), BATF(D7C5), IRF4(4964) and 
Histone-3(1B1B2). The BATF3 (AF7437) antibody was from R&D. Immu- 
noprecipitations were performed in 100 pg of whole-cell protein lysates 
in150 ul of nondenaturing buffer and 7.5 pg of agar-conjugated antibod- 
ies c-Jun (G4) orJunB (C11) (Santa Cruz Biotechnology). After overnight 
incubation at 4 °C, beads were washed three times with nondenaturing 
buffer, and proteins were eluted in Laemmlisample buffer, boiled and 
loaded onto PAGE gels. Detection ofimmunoprecipitated proteins was 
performed with above-mentioned reagents and antibodies. 


Preparation of chromatin fractions 

Separation of chromatin-bound from soluble proteins was performed 
as previously described” using cytoskeletal (CSK) buffer: 10 mM PIPES- 
KOH (pH 6.8), 100 mM NaCl, 300 mm sucrose, 3 mM MgCl, 0.5 mM 
PMSF, 0.1 mM glycerolphosphate, 50 mM NaF, 1mM Na,VO,, containing 
0.1% Nonidet P-40 and protease inhibitors 2 mM PMSF, 10 pg mI leu- 
peptin, 4 yg mI“ aprotinin, and 4 pg mI" pepstatin. In brief, cell pellets 
were lysed for 10 min on ice followed by 5,000 rpm centrifugation at 
4 °C for 5 min. The soluble fraction was collected and cleared by high- 
speed centrifugation, 13,000 rpm for 5 min. Protein concentration was 
determined by Bradford assays. Pellets containing chromatin-bound 
proteins were washed with CSK buffer and centrifuged at 5,000 rpm 
at 4 °C for 5 min. Chromatin-bound proteins were solubilized in1x Lae- 
mmliSample Buffer and boiled for 5 min. Equal volumes of chromatin 


and soluble fraction were loaded for each sample and analysed by 
immunoblotting. 


ChIP and library preparation 

Twenty-million CART cells were fixed with 1% formaldehyde for 10 min 
at room temperature. Cross-linking was quenched using 0.125 M glycine 
for 10 min before cells were washed twice with PBS. Cross-linked pellets 
were frozen with dry-ice ethanol and stored at —-80 °C. Two biological 
replicates were collected for each cell culture. Chromatin immuno- 
precipitations were performed with exogenous spike-ins (ChIP-Rx) 
to allow for proper normalization, as previously described”. In brief, 
pellets were thawed on ice before cell membrane lysis in 5 ml LB1 by 
rotating for 10 min at 4 °C. Nuclei were pelleted at 1,350g for 5 min at 
4°Cand lysed in5 ml LB2 by rotating for 10 min at room temperature. 
Chromatin was pelleted at 1,350g for 5 min at 4 °C and resuspended in 
1.5 ml LB3. Sonication was performed in a Bioruptor Plus until chro- 
matin was 200-700 bp. Debris were pelleted and supernatants were 
collected and Triton X-100 was added to 1% final concentration. Ten per 
cent of the sample was collected as input controls. Anti-RF4 (Abcam 
Ab101168) or anti-c-Jun (Active Motif 39309) targeting antibodies were 
added at 5 pg per immunoprecipitate to sonicated lysate and rotated 
at 4 °C for 16-20 h. 

Protein G Dynabeads (100 pl per immunoprecipitate) were washed 
three times with Block Solution (0.5% BSA in PBS). Antibody-bound 
chromatin was added to beads and rotated for 2-4 hat 4 °C. Bead-bound 
chromatin was washed five times with 1 ml RIPA wash buffer then once 
with 1 ml TE buffer with 500 mM NaCl. Beads were resuspended in 210 
pl. Elution and chromatin was eluted at 65 °C for 15 min. Beads were 
magnetized and supernatant was removed to a fresh tube. Immuno- 
precipitated and input control chromatin was reverse cross-linked at 
65 °C for 12-16 h. 

Samples were diluted with 1 volume TE buffer. RNA was digested using 
0.2 mg mI" RNase A (Qiagen 19101) for 2 hat 37 °C. CaCl, was added to 
5.25 mM and samples were treated with 0.2 mg mI” proteinase K (Life 
Technologies E00491) for 30 min at 55 °C. One volume phenol-chloro- 
form-isoamyl alcohol was added and centrifuged 16,500g for 5 minto 
extract DNA, followed by a second extraction using one volume pure 
chloroform. Aqueous phase was removed and DNA was precipitated 
using two volumes ethanol and 0.3 Msoduim acetate. DNA pellets were 
resuspended in EB elution buffer (Qiagen). 

To prepare libraries for sequencing, DNA was end repaired using T4 
polymerase (New England Biolabs M0O203L), Klenow fragment (NEB 
MO210L), and T4 polynucleotide kinase (NEB MO201L) for 30 min at 
20 °C. 3’ A-tailing was performed using Exo- Klenow fragment (NEB 
MO212L) for 30 min at 37 °C. Illumina TruSeq Pre-Indexed Adaptors 
(1 uM) or NEBNext Illumina Multiplex Oligo Adaptors (NEB E7335S) 
were ligated for 1h at room temperature. Unligated adapters were 
separated by gel electrophoresis (2.5% agarose, 0.5x TBE) and ligated 
DNA was purified using a NucleoSpin Gel Clean-up Kit (Macherey-Nagel 
740609.250). Ligated DNA was PCR amplified using TruSeq Primers 1.0 
and 2.0 or NEBNext Multiplex Primers and purified using AMPure XP 
beads (Beckman Coulter A63881). Purified libraries were quantified 
using Agilent 2100 Bioanalyzer HS DNA and multiplexed in equimolar 
concentrations. Sequencing was performed using an Illumina NextSeq 
or HiSeq at 2 x 75 bp by Stanford Functional Genomics Facility. 


Mice 

Immunocompromised NOD-SCID-//2rg" (NSG) mice were purchased 
from JAX and bred in-house. All mice were bred, housed and treated 
in ethical compliance with Stanford University IACUC (APLAC) 
approved protocols. Six-to-eight-week-old male or female mice were 
inoculated with either 1 x 10° Nalm6-GL leukaemia via intravenous or 
0.5 x 10°-1 x 10° 143B osteosarcoma via intramuscular injections. All 
CART cells were injected intravenously. Time and treatment dose are 
indicated in the figure legends. Leukaemia progression was measured 


by bioluminescent imaging using the IVIS imaging system. Values were 
analysed using Living Image software. Solid tumour progression was 
followed using caliper measurements of the injected leg area. Mice were 
humanely euthanized when an IACUC-approved end-point measure- 
ment reached 1.75 cm in either direction (for solid tumour) or when 
mice demonstrated signs of morbidity and/or hind-limb paralysis 
(leukaemia). Five-to-ten mice per group were treated in each experi- 
ment based on previous experience in these models, and each 
experiment was repeated two or three times as indicated. Mice were 
randomized to ensure equal pre-treatment tumour burden before 
CART cell treatment. In some experiments, researchers were blinded 
to treatment during tumour measurement. 


Blood and tissue analysis 

Peripheral blood sampling was conducted via retro-orbital blood col- 
lection under isoflurane anaesthesia at the indicated time points. Fifty 
microlitres of blood was labelled with CD45, CD3, CD4 and CD8, lysed 
using BD FACS Lysing Solution and quantified using CountBright Abso- 
lute Counting beads (Thermo Fisher) ona BD Fortessa flow cytometer. 
For ex vivo analysis of CAR TILs, 14 days after T cell treatment (day 
28 after tumour engraftment), six mice per group were euthanized, 
solid tumour tissue was collected, mechanically dissociated using the 
gentleMACS dissociator (Miltenyi), and single-cell suspensions were 
either analysed by flow cytometry, re-plated with 3 x 10° Nalm6-Her2* 
target cells for ICS analysis, or labelled for sorting. Live, CD45* TILs 
were sorted from each tumour and re-stimulated at 1:1 effector:target 
ratio with Nalm6-Her2’ target cells. Twenty-four-hour supernatant was 
analysed for IL-2 production by ELISA. 


ATAC-seq 

ATAC-seq library preparation was carried out as previously described”. 
In brief, 100,000 cells from each sample were sorted by FACS into CM, 
centrifuged at 500g at 4 °C, then resuspended in ATAC-seq resuspen- 
sion buffer (RSB) (10 mM Tris-HCl, 10 mM NaCl, 3 mM MgCl) supple- 
mented with 0.1% NP-40,0.1% Tween-20 and 0.01% digitonin. Samples 
were split into two replicates each before all subsequent steps. Samples 
were incubated on ice for 3 min, then washed out with 1 ml RSB sup- 
plemented with 0.1% Tween-20. Nuclei were pelleted at 500g for 10 
min at 4 °C. The nuclei pellet was resuspended in 50 ul transposition 
mix (25 pl 2x TD buffer, 2.5 pl transposase (Illumina), 16.5 pl PBS, 0.5 
pl 1% digitonin, 0.5 pl 10% Tween-20, 5 pl HO) and incubated at 37 °C 
for 30 mininathermomixer with 1,000 rpm shaking. The reaction was 
cleaned up using the Qiagen MinElute PCR Purification Kit. Libraries 
were PCR-amplified using the NEBNext Hi-Fidelity PCR Master Mix 
and custom primers (IDT) as previously described”. Libraries were 
sufficiently amplified following 5 cycles of PCR, as indicated by qPCR 
fluorescence curves”. Libraries were purified with the Qiagen MinElute 
PCR Purification Kit and quantified with the KAPA Library Quantifica- 
tion Kit. Libraries were sequenced on the Illumina NextSeq at the Stan- 
ford Functional Genomics Facility with paired-end 75-bp reads. Adaptor 
sequences were trimmed using SeqPurge and aligned to hg19 genome 
using bowtie2. These reads were then filtered for mitochondrial reads, 
low mapping quality (Q > 20), and PCR duplicates using Picard tools. 
Then we converted the bam toa bed and got the TnS corrected insertion 
sites (‘+’ stranded + 4 bp, “’ stranded —5 bp). To identify peaks, we called 
peaks for each sample using MACS2 ‘-shift -75-extsize 150-nomodel- 
call-summits-nolambda-keep-dup all -p 0.0000! using the insertion 
beds. To get a union peak set, we (1) extended all summits to 500 bp; (2) 
merged all summit bed files; and (3) used bedtools cluster and selected 
the summit with the highest MACS2 score. This was then filtered by 
the ENCODE hg19 blacklist (https://www.encodeproject.org/annota- 
tions/ENCSR636HFF/), to remove peaks that extend beyond the ends 
of chromosomes. We then annotated these peaks using HOMER and 
computed the occurrence of atranscription factor motif using motif- 
matchr in R with chromVARMotifs HOMER set. To create sequencing 


tracks, we read the TnS corrected insertion sites into R and created a 
coverage pileup binned every 100 bp using rtracklayer. We then counted 
all insertions that fell within each peak to get a counts matrix (peak x 
samples). To determined differential peaks we first used peaks that 
were annotated as ‘TSS’ as control genes or ‘housekeeping peaks’ for 
DESeq2 and then computed differential peaks with this normaliza- 
tion. All clustering was performed using the regularized log transform 
values from DESeq2. Transcription factor motif deviation analysis was 
carried out using chromVAR as previously described”. Transcription 
factor motif enrichment were calculated using a hypergeometric test 
inR testing the representation of a motif (from motifmatchr above) in 
a subset of peaks vs all peaks. 


Subset RNA-seq 

For T cell subset-specific RNA-seq, T cells were isolated from healthy 
donor buffy coats as described above. Before activation, naive and 
central memory CD4* or CD8* subsets were isolated using a BD FACSAria 
cell sorter (Stem Cell FACS Core, Stanford University School of Medi- 
cine) using the following markers: naive (CD45RA*CD45RO_, CD62L", 
CCR7'*, CD95", and CD122_), central memory (CD45RA CD45RO", 
CD62L*, CCR7’*). Sorted starting populations were activated, trans- 
duced and cultured as described above. On days 7,10 and 14 of culture, 
CAR‘ CD4* and CD8* cells were sorted, and RNA was isolated using 
Qiagen mRNEasy kit. Samples were library prepped and sequenced 
via Illumina NextSeq paired end platform by the Stanford Functional 
Genomics Core. 


Bulk RNA-seq 

For bulk RNA isolation, healthy donor T cells were prepared as 
described. On day 10 or 11 of culture, total mRNA was isolated from 2 
x 10° bulk CAR T cells using Qiagen RNEasy Plus mini isolation kit. Bulk 
RNA-seq was performed by BGI America (Cambridge, MA) using the 
BGISEQ-500 platform, single-end 50-bp read length, at 30 x 10° reads 
per sample. Principal component analysis was performed using stats 
package and plots with ggplot2 package in R (version 3.5)”. GSEA was 
performed using the GSEA software (Broad Institute) as described**™. 
DAVID analysis was performed for transcription factor enrichment as 
described****, 


Single-cell RNA-seq 

To compare gene expression in single CD19-28z and GD2-28z CAR 
cells, we sorted naive T-cell subset on day O for subsequent single-cell 
analysis on day 10 using the Chromium platform (10X Genomics) and 
the Chromium Single Cell 3’ v2 Reagent Kit according to the manu- 
facturer’s instructions. cDNA libraries were prepared separately for 
CD19-CAR and GD2-CAR cells, and the CD4* cells and CD8* cells were 
combined in eachrun to be separated bioinformatically downstream. 
Sequencing was performed on the Illumina NextSeq system (paired- 
end, 26 bp into read 1 and 98 bp into read 2) to a depth of more than 
100,000 reads per cell. Single-cell RNA-seq reads were aligned to the 
Genome Reference Consortium Human Build 38 (GRCh38), normal- 
ized for batch effects, and filtered for cell events using the Cell Ranger 
software (10X Genomics). A total of 804 CD19-CAR and 726 GD2-CAR 
T cells were sequenced to an average of 350,587 post-normalization 
reads per cell. The cell-gene matrix was further processed using the 
Cell Ranger R Kit software (10X Genomics) as previously described*. In 
brief, we first selected genes with at least one unique molecular identi- 
fier (UMI) counts in any given cell. UMI counts were then normalized 
to UMI sums for each cell and multiplied by amedian UMI count across 
cells. Next, the data were transformed by taking a natural logarithm 
of the resulting data matrix. For correlation network of exhaustion- 
related transcription factors, transcription factor genes identified as 
differentially expressed (P< 0.05) by DESeq2 form the nodes of the 
network. Colours represent log,-transformed fold change (GD2 vs 
CD19 CAR). Edge thickness represents the magnitude of correlationin 
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expression between the relevant pair of genes across cells. Correlation 
score greater than 0.1 was used to construct networks. 

To compare gene expression in single JUN-overexpressing and con- 
trol Her2-BBz CART cells in vivo, live human CD45* tumour-infiltrating 
cells were sorted and pooled from six NSG mice bearing 143B osteo- 
sarcoma tumours 14 days after CAR T cell infusion. Sorted cells were 
analysed using the 10X Genomics platform as described above and 
sequenced on the Illumina HighSeq 4000 system to a depth of more 
than 50,000 reads per cell. A total of 6,946 Her2-BBz and 10,985 JUN- 
Her2-BBz cells were sequenced to an average of 49,542 post-normali- 
zation reads per cell. The cell-gene matrix was further processed using 
the Seurat v.3.0 software™*~”. In brief, we selected genes expressed in >50 
cells. Single live cells were selected as droplets expressing >500 genes 
with <20,000 UMI counts and <10% mitochondrial reads. UMI count 
data matrix was transformed and scaled, including variable feature 
selection, with SCTransform pipeline. T cells were selected as CD3* 
events (99.3% cells expressing CD3G, CD3D, CD3F, and/or CD247 gene). 
Where indicated, CD4* and CD8* T cell subsets were selected (8.3% CD4* 
CD8 , 70.3% CD4 CD8"). The resulting data matrix was then examined 
using differential expression analysis, cell cycle analysis, clustering, 
and UMAP embedding. 


Statistical analysis 

Unless otherwise noted, statistical analyses for significant differences 
between groups were conducted using unpaired two-tailed t-tests 
without correction for multiple comparisons and without assuming 
consistent s.d. using GraphPad Prism 7. Survival curves were compared 
using the log-rank Mantel-Cox test. See Supplementary Table 2 for 
full statistical analyses, including exact P values, t-ratio, and degrees 
of freedom. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| High-affinity 14g2a-GD2(E101K) CAR T cells manifest 
an exaggerated exhaustion signature compared with the original 14g2a- 
GD2 CART cells. a, Surface inhibitory receptor expression in CD19, GD2 and 
HA-GD2(E101K) CART cells at day 10 of culture. High-affinity E101K mutation 
results in increased inhibitory receptor expression in CD4* and CD8* CAR 
Tcells, compared with parental GD2 CART cells. b, IL-2 secretion after 24h co- 
culture of HA-GD2(E101K) or original GD2-28z CAR T cells with GD2* target 
cells. The increased exhaustion profile of HA-GD2(E101K) CAR T cells 
corresponds to decreased functional activity, as measured by the ability to 
produce IL-2 after stimulation. Data are mean +s.d. of triplicate wells; 
representative of four independent experiments. c, PCA of bulk RNA-seq 
demonstrates larger variance between HA-GD2(E101K) and CD19 CART cells, 


whereas GD2-28z (short hinge) CART cells are intermediary. Left, CD4* T cells. 


Right, CD8* CART cells, naive-derived. Number of replicates is indicated. 
d,e, HA-GD2(E101K) CAR expression causes enhanced inhibitory receptor 
expression (d) and decreased memory formation (e) inCD4* CAR T cells. 


(See Fig. 1 for CD8* data). f, IL-2 secretion from control CD19-28z CAR T cells or 
CD19 CAR T cells with bi-cistronic expression of tonically signalling HA- 
GD2(E101K) (19-HA-28z, blue) or bi-cistronic expression of Her2-28z (19-Her2- 
28z, grey) after 24-h stimulation with Nalm6 (CD19°GD2 Her2 ) target cells to 
demonstrate that co-expression of HA-GD2-28z CAR induces T cell dysfunction 
in CD19-28z CAR T cells. Data are mean +s.d. of triplicate wells; representative 
of three independent experiments. g, RNA-seq PCA from Fig. 1showing PC2 
separation is driven by central memory versus naive starting subset and PC3 
separation driven by CD4 versus CD8.h, Gene set enrichment analysis (GSEA): 
gene sets upregulated in day-10 HA-28z CART cells versus CD19-28z CART cells 
showed significant overlap with genes upregulated in exhausted versus 
memory CD8* cells (left), exhausted versus effector CD8* cells (middle), and 
exhausted versus naive CD8* cells (right) ina mouse model of chronic viral 
infection*. NES, normalized enrichment score. Inbandf, Pvalues determined 
by unpaired two-tailed t-tests. 
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driving the separation of CD19 and HA-28z CART cells in bulk RNA-seq (yellow, plots depicting individual gene expression in CD8* GD2-28z and CD19-28z 
Fig. 1f). In total, 79 out of the top 200 genes from bulk RNA-seqare differentially single CART cells. Genes upregulated in GD2 CART cells include inhibitory 
expressed by DESeq2 analysis in GD2-28z versus CD19-28z single cells. receptors, effector molecules and AP-1 family transcription factors, whereas 
Highlighted genes from the intersection include inhibitory receptors (CTLA4, CD19 CAR T cells have increased expression of memory-associated genes. P 
LAG3, GITR), effector molecules (CD25, IFNG, GZMB), cytokines (/L13 and /L1A) values determined by unpaired two-tailed Wilcoxon-Mann-Whitney Utest. 
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Extended Data Fig. 3 | ATAC-seq data quality control. a, b, Insert length (a) d, Location of mapped peaks in each sample by total number of peaks (top) and 
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Extended Data Fig. 4 | AP-1 family transcription factors in HA-28z exhausted 
CART cells. a, Differentially accessible chromatin regions in CD4* CD19-28z 
and HA-28z CAR T cells. Both the naive- and central-memory-derived subsets 
are incorporated for each CAR. b, PCA from Fig. 1h showing PC2 separation is 
driven by central memory versus naive, and PC3 separation driven by CD4 
versus CD8.c¢, Top transcription factor motifs enriched in chromatin regions 
differentially accessible in HA-28z CAR T cells comprise AP-1-bZIP family 
factors inall starting T cell subsets. CD8* naive subset is shown in Fig. 2.d, Peak 
clustering by shared regulatory motif (left) and enrichment heat map of 
transcription factor motifs (right) in each cluster. Ten different clusters 
including clusters associated with exhausted (EX1-EX4) or healthy (HLT1- 
HLT2) CAR T cells, central memory or naive starting subset, and CD4 or CD8 
Tcell subset. Genes of interest in each cluster are highlighted to the right. 

e, Bulk RNA-seq expression (fragments per kilobase of exon model per million 
reads mapped, FPKM) of indicated AP-1-bZIP and IRF family members in CD19- 
28z (black) and HA-28z (blue) CART cells. Dataare mean+s.e.m. fromn=6 
samples across three donors showing paired CD19 versus HA expression for 
each gene. *P<0.05, **P< 0.01, ***P< 0.001, Wilcoxon matched-pairs signed 
rank test (see Supplementary Information for exact Pvalues). f, Increased 
protein expression of c-Jun, JunB, BATF3 and IRF4 in HA-28z versus CD19-28z 
CART cells at day 10 of culture by immunoblotting. Densitometry 
measurements of the fold change in HA compared with CD19 (n =2-5 
experiments). P values determined by unpaired two-tailed t-tests. 

g, Correlation network of exhaustion-related transcription factor in 


naive-derived CD8* GD2-28z CART cells using single-cell RNA-seq analysis. 

h, Violin plots depicting single-cell gene expression of FOS, JUN, BATF and /IRF4 
in CD8* clusters associated with response (CD8.G) and non-response (CD8.B) in 
patients with metastatic melanoma after checkpoint therapy (CD8T-post- 
CD8G.B)”’. Pvalues determined by unpaired two-tailed Wilcoxon-Mann- 
Whitney Utest. Ini-m, AP-1-modified HA-28z CART cells exhibit enhanced 
functional activity. i-k, CAR T cells were co-transduced with (AP-1) or without 
(control) a lentiviral vector encoding AP-1 transcription factors Fos and c-Jun 
and atruncated NGFR (tNGFR) surface selection marker. i, Schematic of the 
lentiviral construct.j, Representative transduction efficiency of AP-1-modified 
CART cells as measured by NGFR surface expression in indicated CD4* and 
CD8* CART cells. k, IL-2 production in control (black) or AP-1-modified (red) 
CART cells after 24-h stimulation with 143B-CD19 target cells. AP-1-modified 
HA-28z CART cells show increased IL-2 production compared with control CAR 
Tcells. Dataare mean +s.d. of triplicate wells; representative of two 
independent experiments. I, m, CART cells were co-transduced with lentiviral 
vectors encoding either c-Fos or c-Jun and atNGFR surface selection marker. 

I, Schematics of the c-Fos and c-Jun lentiviral constructs. m, IL-2 productionin 
control (blue), Fos (green) or c-Jun (red) modified CART cells after 24-h 
stimulation with Nalm6-GD2 target cells. Data are mean ¢+s.d. of triplicate 
wells; representative of two independent experiments. Asterisk iniandl 
denotes astop codon. Pvalues determined by unpaired two-tailed t-tests. ns, 
not significant (P> 0.05). 
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Extended Data Fig. 5| See next page for caption. 


Article 


Extended Data Fig. 5| Enhanced activity of JUN-modified CART cells. JUN 
CART cells were produced as in Fig. 3. a, c-Jun overexpression does not affect 
the CD4:CD8 ratio in HA-28z CAR T cells at day 10 of culture (n=14 independent 
experiments). Lines indicate paired samples from the same donor. Pvalues 
determined by paired, two-tailed t-tests. b,c, Fold increase in IL-2 (b) and IFNy 
(c) release after 24-h co-culture with the indicated target cells inJUN versus 
control CD19 and HA-28z CART cells. Each dot represents one independent 
experiment from different donors of n=8 total experiments. d, Representative 
contour plots demonstrating increased intracellular cytokine production in 
both CD4* and CD8* JUN-HA-28z versus control HA-28z CART cells stimulated 
for Shwith Nalm6-GD2 target cells. Representative of three independent 
experiments. e, Left, flow cytometry showing representative CD45RA and 
CD62L expression in control versus JUN CAR CD4' T cells at day 10. Right, 
relative frequency of effector (CD45RA*CD62L ), stem-cell memory 
(CD45RA*CD62L*), central memory (CD45RA CD62L’), and effector memory 
(CD45RA CD62L) cells in CD4* control or JUN-HA-28z CAR T cells (n=6 donors 


from independent experiments). Lines indicate paired samples fromthe same 
donor. Pvalues determined by paired two-tailed t-tests. f, Extended in vitro 
expansion of control (blue) or JUN-modified (red) CD19 CART cells in five 
independent experiments with five different healthy donors. At the indicated 
time points, T cells were re-plated in fresh T cell media with 1001U mI“ IL-2. 
Tcells were counted and fed to keep cells at 0.5 x 10° cells per ml every 2-3 days. 
For Donor-1, 5 x 10° viable T cells were re-plated on days 14 and 28. For Donor-2, 
510° viable T cells were re-plated on days 14, 28, 42 and 56. For Donor-3, 5 x 10° 
viable T cells were re-plated on days 10, 17, 24 and 31. For Donor-4 and Donor-5, 
510° viable T cells were re-plated on days 10, 17,24 and 34. g, On day 42 of 
culture, 1x 10° viable T cells from Donor-4 (top) and Donor-5 (bottom) were re- 
plated and cultured for 7 days with or without IL-2. h-j, Cell-surface phenotype 
of control or JUN-CD19-28z CART cells from Fig. 3g (Donor-3) on day 46 of 
culture. h, CD4 versus CD8 expression. i, Surface expression of CD45RA versus 
CD62L.j, Day 46 surface exhaustion marker expression in CD8°T cells. 


a Cc PC1 — HEALTHY 


PC1 — HEALTHY 


Genes DOWN 


AMO> oP 


PC1 - EXHAUSTED PC1- EXHAUSTED 


DAVID Bioiet 


JUN-HA-28z HA-28z Control 


Exhausted T cells c-Jun Rescue of T cell exhaustion 


CTLA-4 i 4 
BATF/ oc ENTPD1 Hee \ 
BATF3 PROMI 


GZMB 
IRF4 Ip IFNG... Disrupted AP1-i Complexes 


VIN AN LI + a NANI NINN —— 


AP1-i Complex 
c-Jun/e-Fos i =! 
o he : Proliferation IL-2 


Direct DOD E MLM LG LF 


Activating AP-1 Complex 


f JUN JUN g Soluble Fraction Chromatin-bound Fraction 
Con WT AA seek eye a 3 QNa 
8 Nas 2 o 5 o 
— c-Jun 8& 23 S58 3B 8 5 $ 3 ao 33 
a an as on ee wa _indogenous 
pete as = = c-Jun 
— P-c-Juns73 << _—— | a 
==} -————-- 7 LOADING 


Extended Data Fig. 6| See next page for caption. 


Article 


Extended Data Fig. 6 | c-Jun overexpression mediates transcriptional but 


not epigenetic reprogramming of exhausted HA-28z CART cells. a, The log,- 


transformed fold change in HA versus JUN-HA ATAC-seq demonstrating no 
significantly different peaks between conditions. b, Gene expression of 319 
genes differentially expressed inJUN versus HA-28z CART cells (log,- 
transformed fold change > 2, P,4, < 0.05). Genes downregulated inJUN CAR 

T cells (blue) include exhaustion-associated genes such as BATF3, GZMB, LAG3, 
JUNB and ENTPD1 (encoding CD39). Genes upregulated inJUN CART cells (red) 
include genes associated with naive and memory differentiation such as /L7R, 
LEF1, SELL (CD62L), CD44, and KLF3.c, Venn diagrams showing overlap of the 
319 genes differentially expressed inJUN versus HA-28z and the top 200 genes 
distinguishing exhausted (HA) and healthy (CD19) CAR T cells from PC1in 

Fig. le, f. Genes downregulated inJUN CART cells overlap with exhaustion- 
associated (HA, PC1-exhausted) genes, and genes upregulated inJUN CAR 
Tcells overlap with genes associated with healthy memory differentiation 


(CD19, PC1-healthy). d, DAVID bioinformatics analysis of transcription factor- 
binding sites within the 319 genes differentially expressed inJUN CAR T cells 
reveals that the top transcription factor binding motif belongs to the AP-1 
family (269 out of 319 genes). e, Proposed mechanisms of c-Jun-mediated 
rescue of T cell exhaustion. AP-li indicates an exhaustion-associated AP-1 
complex. f, Immunoblot of total c-Jun and phosphorylated c-Jun (p-c-Jun’”’) in 
control, JUN-WT and JUN-AA HA-28z CART cells. g, Immunoblot analysis of 
c-Jun protein expression in control and indicated JUN-variant-expressing HA- 
28z CART cells in either soluble or chromatin-bound cellular lysate fractions. 
c-Jun variants with deletions in the C-terminal DNA binding and leucine zipper 
dimerization domains (basic, LeuZ and bZIP) cannot bind chromatin and do not 
rescue functional activity. h, Decrease in mRNA expression of BATF, BATF3 and 
JUNB inJUN HA-28z CART cells compared with HA-28z cells (n=3 donors, 
normalized to CD19 mRNA). Pvalues determined by ratio paired two-tailed 
t-test. See Supplementary Fig. 1 for gel source data. 
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Extended Data Fig. 7 | c-Jun overexpression decreases chromatin binding 
and complexing of JunB-BATF-BATF3-AP-1 complexes. a, Immunoblot 
analysis for the indicated AP-1-bZIP and IRF family member proteins in control 
and JUN CD19-28z and HA-28z CART cells (day 10). b, Immunoblot analysis for 
the indicated AP-1-bZIP and IRF family member proteins in control andJUN HA- 
28z CAR T cells (day 10) in either soluble or chromatin-bound cellular lysate 
fractions. c, c-Jun overexpression decreases JunB-BATF and JunB-BATF3 
complexes by immunoprecipitation-immunoblot analysis. Input (left), 
immunoprecipitation for c-Jun (middle), or JunB (right) in control orJUN HA- 
28z CAR T cells. Levels of IRF4 protein and complexes with c-Jun are unchanged. 
d-h, ChIP-seq analysis for c-Jun and IRF4. d, Motif enrichment in IRF4-bound 
(left) or c-Jun-bound (right) loci. e, IRF4 signal genome-wide. Data shown for 
each transduction at all IRF4-bound sites. Thex and y axes show log- 
transformed normalized count signal in control andJUN-overexpression cells, 
respectively. f, IRF4 and c-Jun ChIP-seq genome tracks inJUN or control HA-28z 
CART cells. c-Jun ChIP with reference exogenous genome (ChIP-Rx; top), withx 
axis representing genomic position and y axis representing reference-adjusted 
reads per million (RRPM). IRF4 ChIP (bottom), withx axis representing 
genomic position and y-axis representing reads per million (RPM). Arrows 
indicate peaks with increased c-Jun binding in HA-28zJUN cells at IRF4-bound 
sites within genes previously described to be regulated by IRF4 or BATF (TCF7, 
HAVCR2and HIFIA)’. g, Overexpressed c-Junis bound to IRF4-occupied sites in 
the genome. Enrichment plot of c-Jun ChIP-Rx signal (left) or IRF4 ChIP-seq 


signal (right) in either JUN overexpression (red) or control (blue) HA-28z CAR 

T cells at all JUN-bound sites. The x axis shows distance from centre of JUN- 
bound site, and yaxis shows average RRPM across replicates for c-Jun ChIP or 
average RPM across replicates for IRF4 ChIP. h, Venn diagram showing number 
of genes bound by IRF4 and/or c-Jun (n genes expressed/n genes bound). GSEA 
analysis with genes bound only by IRF4 (top) and genes bound by c-Jun and IRF4 
(bottom), comparing levels of expression inJUN versus control HA-28z CAR 

T cells (normalized P< 0.05, FDR <25%) i, Immunoblot of indicated AP-1 and IRF 
proteinin control or CRISPR-knockout HA-28z CART cells demonstrating 
productive knockout of target protein.j, IL-2 (top) and IFNy (bottom) release in 
HA-28z CART cells with control or CRISPR-knockout of the indicated AP-1or 
IRF4 gene after 24 hstimulation with Nalm6-GD2 or 143B target cells. Data are 
mean +s.d. of triplicate wells; representative of six independent experiments. 
Fold change across all experiments in Fig. 4e. Ink and I, NSG mice were 
inoculated with 1 x 10° Nalm6-GD2 leukaemia cells via intravenous injection. 
Astress-test dose of 1 x 10° mock, HA-28z control, JUN-WT, JUN-AA or JUN- 
AbZIP HA-28z CAR‘ T cells was given intravenously on day 7. k, Tumour 
progression was monitored using bioluminescent imaging. I, JUN-WT and JUN- 
AAHA-28z CART cells enhanced long-term survival, and control andJUN-AbZIP 
HA-28z CART cells were almost non-functional compared with mock 
untransducedT cells at this dose. Data are mean +s.e.m. ofn=5 mice per group. 
For gel source data see Supplementary Fig. 1. 
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Extended Data Fig. 8 | Functional rescue of exhausted HA-28z CART cells 
requires the presence of c-Jun during both chronic and acute T cell 
stimulation. a, Schematic of the destabilization domain (DD)-regulated JUN 
expression vector. b, Schematic of drug-induced stabilization of JUN-DD 
expression. Yellow diamond denotes trimethoprim (TMP)-stabilizing 
molecule. c, Kinetics of drug-induced c-Jun stability inJUN-DD CART cells as 
assessed by immunoblot. At time 0, 10 1M TMP was either added to untreated 
cells (ON) or washed out of previously treated cells (OFF). Cells were removed 
from each condition at1, 2,4, 8,24 and 48 hand prepared for immunoblot 


analysis of c-Jun expression. The observed band corresponds to the size of JUN- 


DD. d, Densitometry analysis was performed on the blots fromc and 
normalized to loading control. Expression was plotted against time and first- 
order kinetics curves were fit to the data to determine ¢,,. for OFF and ON 
kinetics. e, Total c-Jun expression in control, JUN-WT andJUN-DD HA-28z CAR 


T cells at day 10 by intracellular flow cytometry (left) and immunoblot (right). 
f, IL-2 (left) and IFNy (right) production in control (blue), JUN-WT (red) or JUN- 
DD (OFF-green, ON-purple) modified HA-28z CAR T cells 24 h after stimulation 
with Nalm6-GD2 or 143B target cells, or media alone (baseline) at day 10. Ine 
and f, OFF indicates without TMP, ON indicates T cells cultured in the presence 
of 10 1M TMP from day 4 and during co-culture. Ing andh, TMP was added 
either during T cell expansion (starting at day 4) or only during co-culture with 
tumour cells as indicated ing. For ON-to-OFF and OFF-to-ON conditions, TMP 
was removed or added 18 h before co-culture to ensure complete c-Jun 
degradation or stabilization, respectively, before antigen exposure. h, IL-2 
expression in one representative donor (left, s.d. across triplicate wells) and 
fold increase in IL-2 (s.e.m. of n=6 independent experiments representing 
three different donors, relative to OFF-OFF condition). Pvalues determined by 
unpaired two-tailed t-tests. For gel source data, see Supplementary Fig. 1. 
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Extended Data Fig. 9 | c-Jun overexpression enhances CD19-BBz CART cells 
activity under suboptimal antigen stimulation and Her2 or GD2-BBzCAR 
Tcellfunctionin solid tumours. a, CD19 surface expression on parental 
Nalm6-WT (green), CD19-knockout Nalm6 (19KO, black), and two different 
CD19-low Nalmé6 clones (F and Z) Nalm6-19KO plus CD19'-F (F; blue), and 
Nalm6-19KO plus CD19 '°"-Z (red). b, IL-2 (left) and IFNy (right) release after co- 
culture of control (blue) or JUN (red) CD19-BBz CAR T cells exposed to Nalm6- 
WT and Nalm6-19' clones F and Z.c, JUN versus control CD19-BBz CART cell 
lysis of GFP* Nalm6-WT (top), Nalm6-F (middle) or Nalm6-Z (bottom) target 
cells at a1:2 effector:target cell ratio, demonstrating enhanced activity of JUN 
CART cells at low antigen density. Datain b andc denote mean +s.d. of 
triplicate wells; representative of four independent experiments. In d-f, NSG 
mice were inoculated with 1 x 10° Nalm6-19' clone F leukaemia cells. On day 1, 
3x 10° control or JUN CD19-BBz CAR‘ T cells or 3 x 10° mockOtransduced T cells 
were transferred intravenously. d, Tumour growth was monitored by 
bioluminescent imaging. e, JUN expression significantly improved the long- 
term survival of CAR-treated mice. f, Mice receiving JUN-CD19-BBz CAR T cells 
display increased peripheral blood T cells on day 20. Datain d-f denote 
mean+s.e.m. ofn=5 mice per group; representative of three independent 
experiments. Long-term tumour-free survival is impeded in this model owing 
to outgrowth of CD19-negative disease. In g-i, NSG mice were inoculated with 


1x 10°143B osteosarcoma cells viaintramuscular injection; then, 1x10’ mock, 
Her2-BBz or JUN-Her2-BBz CART cells were given intravenously on day 7. 

g, Tumour growth was monitored by caliper measurements. h, Long-term 
survival.i, On day 20 after tumour implantation, peripheral blood T cells were 
quantified in mice treated as ing. Data are mean+s.e.m. ofn=5 mice per group; 
representative of two independent experiments. j, Vector schematic of JUN- 
GD2-BBz retroviral vector construct. k, IL-2 (left) and IFNy (right) productionin 
JUN-modified (red) or control (blue) GD2-BBz CART cells after 24 h stimulation 
with Nalm6-GD2 or 143B target cells. 1, GD2-BBz CART cell lysis of GFP* Nalm6- 
GD2target cells at 1:1 (left) or 1:4 (right) effector:target cell ratio. Dataink andl 
denote mean +s.d. of triplicate wells; representative of four independent 
experiments. Inmandn, NSG mice were inoculated with 0.5 x 10° 143B-19 
osteosarcoma cells via intramuscular injection; then, 1x10’ mock, GD2-BBzor 
JUN-GD2-BBz CART cells were given intravenously on day 3.m, Tumour growth 
was monitored by caliper measurements. n, Peripheral blood CD4* (left) or 
CD8’ (right) T cell counts at day 14 after tumour engraftment. Dataare 

mean t+s.e.m. ofn=5 mice per group; representative of two independent 
experiments although early deaths (unrelated to tumour size) precluded 
survival curves in both models. Pvalues inn were determined by a Mann- 
Whitney test. All other Pvalues determined by unpaired two-tailed t-tests. 
Survival curves were compared using the log-rank Mantel-Cox test. 
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Extended Data Fig. 10 | c-Jun overexpressing CAR TILs demonstrate 
increased activity in osteosarcoma xenograft tumours. Experimental design 
described in Fig. 6.a—c, Frequency (a), phenotype (b) and ex vivo functional 
activity (c) of CD4* TILs from mice treated with Her2-BBz or JUN-Her2-BBz CAR 
Tcells.a, Left, CD4* levels as a proportion of total live tumour cells. Right, CAR* 
asa frequency of total live CD4*. b, Percentage of PD-1' (left) and PD-1 mean 
fluorescence intensity (MFI; middle) of total live CD4* with representative flow 
histograms (right). Mock untransduced T cells were from spleens of tumour- 
bearing mice at the same time point. c, Frequency of indicated cytokine- or 
CD107a-producing cells after 5-h re-stimulation with Nalm6-Her2' target cells. 
Gated on total, live CD4* T cells (left) with representative contour plots (right). 
Data are mean +s.e.m. of n=6 mice per group. Each data point represents an 
individual mouse. Pvalues determined by unpaired two-tailed t-tests. Ind-g, 
dissociated tumour cell suspensions were labelled and sorted by FACS analysis 


to isolate live, human CD45° TILs. Sorted cells from six mice per group were 
pooled and approximately 10,000 cells were processed for 3’ single-cell RNA- 
seq onthe 10X Genomics platform. d, Volcano plot showing results of 
differential expression analysis comparing JUN-Her2-BBz CAR T cells with 
control Her2-BBz CART cells. Top three upregulated and downregulated genes 
are highlighted. e, Heat map of the top 20 most significantly upregulated and 
downregulated genes. f, Uniform manifold approximation and projection 
(UMAP) embedding analysis showing JUN-Her2-BBz and control Her2-BBz CAR 
T cells overlaid. g, Expression of indicated transcripts inJUN-Her2-BBz or 
control Her2-BBz CAR T cells showing localization of CD4* and CD8’ subsets, 
activation marker (/L2RA), exhaustion marker (NR4A2), and maintenance ofa 
small memory-like population (/L7RA, KLF2) inJUN-overexpressing Her2-BBz 
CAR T cells within the solid osteosarcoma tumour microenvironment. 
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bioluminescent imaging acquisition. Image Studio Software ver3.1 (LI-COR) for image acquisition of western blots. Incucyte ZOOM 
ver2016B (Essen BioScience) for incucyte image acquisition (killing assay). 
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Data analysis FlowJo ver 10.4.2 for analysis of flow cytometry. GSEA ver 2.0 (Broad Institute, UCSD) for gene set enrichment analysis. GraphPad Prism 
ver 7.0 (GraphPad) for biostatistical analysis and scientific graphing. Excel ver 16.14.1 (Microsoft) for spreadsheet management. 
SnapGene ver 4.2.1 (GSL Biotech LLC) for vector visualization and cloning. ImageJ ver 1.51J (NIH) for densitometry image analysis. 
Livinglmage ver4.5 (Perkin Elmer) for IVIS Spectrum bioluminescent imaging analysis. Incucyte ZOOM ver2016B (Essen BioScience) for 
incucyte image analysis and quantification. DESeq2 (version 2.1.24) and ggplot2 (Version 3.2.1)package in R (ver 3.5.0) for PCA and plots 
of bulk RNA-seq data. For single cell RNA-seq analysis: CD19-28z vs. GD2-282z: Cell Ranger (version 1.2.0), R Studio (version 1.1.453), R 
(version 3.5.0), and packages: cellrangerRkit (version 2.0.0) dplyr (version 0.7.6) DESeq2 (version 1.20.0 ) GO.db (version 3.6.0) 
org.Hs.eg.db (version 3.6.0) gplots (version 3.0.1) pheatmap (version 1.0.10) and igraph (version 1.2.1). ggplot2 (version 3.0.0); Her2-BBz 
vs. JUN-Her2-BBz: Cell Ranger (version 3.0.2), R Studio (version 1.1.456), R (version 3.5.2), future (version 1.14.0), sctransform (version 
0.2.0), gplots (version 3.0.1.1), Seurat (version 3.0.3.9035), dplyr (version 0.8.3), reshape2 (version 1.4.3), ggplot2 (version 3.2.1). For 
ATAC-seq analysis: SeqPurge, Bowtie2 (Johns Hopkins, ver2.1.0), Picard tools (Broad Institute, ver 1.79), And the following packages in R: 
MACS2 (ver 2.1.0.20150731), Bedtools, Motifmatchr , rtracklayer, DESEq2 (ver 1.20.0), and ChromVAR. ENCODE hg19 blacklist (UCSC 
genome browser), HOMER Motif Analysis ver 4.9 (UCSD). chromVAR ver 1.60. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The sequencing datasets generated in this publication have been deposited in NCBI's Gene Expression Omnibus and are accessible through GEO Series accession 
numbers: bulk RNA-seq: GSE136891, scRNA-seq CD19/GD2-282: GSE136874, scRNA-seq Control/JUN-Her2-BBz TILs: GSE136805, ATAC-seq: GSE136796, ChIP-seq: 
GSE136853. Data Figures associated with these datasets are Figures 1, 2, and 6 and Extended Data Figures 1,2,3,4,6,7, and 10. 


Field-specific reporting 


Please select the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


DX] Life sciences [_] Behavioural & social sciences [| Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/authors/policies/ReportingSummary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Our established animal models consistently yield low variance, and difference in the means between groups greater than 50%. Power 
calculation estimates determined that 5 animals per group was sufficient to determine statistical significance. 


Data exclusions o data were excluded from analysis 


Replication n vivo experiments were replicated at least 2 times with independent experiments (as stated in the figure legends) with similar results. All 
replication results provided the same trends and conclusions. As noted, long-term tumor-free survival in the Nalm6-GD2 model (Figure 5) was 
impacted due to outgrowth of GD2(-) clones in some experiments. 

n vitro experiments were typically run with triplicate technical replicates and were reproduced at least 3 times in independent experiments 
(as stated in the figure legends) with similar results. 


Randomization For in vivo tumor models, mice were randomized to ensure equal mean tumor burden before T cell transfer. 


Blinding nvestigators were blinded during in vivo tumor measurement. Otherwise, fully informed data analysis was performed. Fully blinded 
experiments were not possible due to personnel availability to accommodate such situations. 


Reporting for specific materials, systems and methods 
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Materials & experimental systems 


[_]| BX] Antibodies 
[_]| BX] Eukaryotic cell lines 


C] X] Flow cytometry 
D<]|[] MRI-based neuroimaging 


C] X] Animats and other organisms 
[_]| BX] Human research participants 


Unique biological materials 
Policy information about availability of materials 


Obtaining unique materials Reagents (vector constructs) developed in house during this study are available upon request from the corresponding author. 


Antibodies 
Antibodies used 


Validation 


From BioLegend: CD4-APC-Cy7 (clone OKT4), CD8-PerCp-Cy5.5 (clane SK1), TIM3-BV510 (clone F38-2E2), CD39-FITC or APC-Cy7 
{clone A1), CD9S-PE (clone DX2), CD3-PacBlue (clone HIT3a), CD107a-BV607 (Clone H4A3), IL2-PECy7 Clone MQ1-17H12, IFNeg- 
APC/Cy? Clone 48.83, and TNFa-BV711 Clone Mab11. 

From eBioscience: PD1-PE-Cy7 (clone eBio J105), LAG3-PE (done 3DS223H), CD45RO-PE-Cy7 (clone UCHL1), CD45-PerCp-Cy5.5 
{clone HI30), 

From 8D: CD45RA-FITC or BV711 (clone HI100), CCR7-BV421 (clone 150503), CD122-BV510 (clone Mik EB), CD62L-BV605 (clone 
DREG-56), CD4-BUV395 (clone SK3), CD8-BUV805 (clone SK1). 

147 obtained from NCl-Frederick, CD19-Kdiotype provided by Laurence Cooper. 

For western blot: Cell Signaling: GAPDH (D46CR), Histone-3 (18182), c-Jun (60A8), P-c-JunSer73 (D47G9), JunB(C37F9), 
BATF(D7C5) and IRF 4(4964). The BATF3 (AF7437) antibody was from R&D. The b-Actin (Gone AC-15)-HRP conjugated was from 
Sigma-Aldrich. 


All antibodies used for flow cytometry were validated by the manufacturer directly in human peripheral blood mononuclear 
cells. In our laboratory, antibody-specific staining was compared to isotype and no staining control samples. 

CD4-APC-Cy7 (clone OKT4) was validated here https://www.biolegend.com/en-us/products/ape-cyanine7-anti-human-cd4- 
antibody-3658 

CD8-PerQp-Cy5.5 (clone SK1) was validated here https://www.biclegend.com/en-us/products/percp-cyanineS-5-anti-human- 
ed8-antibody-6389 

TIM3-BV5 10 (clone F38-2E2) was validated here https://www.biclegend.com/en-us/search-results/brilliant-vialet-5 10-anti- 
human-cd366-tim-3-antibody-12009 

CD39-FITC or APC-Cy7 (clone A1) was validated here https://www.biolegend.com/en-us/products /fite-anti-human-cd39- 
antibody-4363 

CD95-PE (clone DX2) was validated here https://www.biolegend.com/en-us/products/pe-anti-human-cd95-fas-antibody-643 
CD3-PacBlue (clone HIT3a) was validated here https://www_biolegend.comyja-jp/search-results/pacific-blue-anti-human-ed3- 
antibody-6505 

CD107a-BV607 (Clone H4A3) was validated here https://www.bialegend.com/en-us/products/pe-anti-human-cd 107a-lamp-1- 
antibody-4967 

1L2-PECy? Clone MQ1-17H12 was validated here https://www.biolegend.com/en-gb/products/pe-cy7-anti-human-il-2- 
antibody-65 13 

IFNg-APC/Cy7 Clone 45.83 was validated here https://www.biolegend.com/en-us/products/ape-anti-human-ifn-gamma- 
antibody- 1012 

TNFa-BV7 11 Clone Mab11 was validated here https://www.biolegend.com/en-us/products/brilliant-violet-7 11-anti-human-tnf- 
alpha-antibody-9034 

PD1-PE-Cy7 (clone eBio J105) was validated here https://www.thermofisher.com/antibody/product/CD279-P D-1-Antibody- 
clone-e Bio) 105-J105- Monoclonal/25-2799-42 

LAG3-PE (clone 3D$223H) was validated here https://www.thermofisher.com/antibody/product/CD223-LAG-3-Antibody- 
clone-3D$223H-Monodonal/12-2239-42 

CD45RO-PE-Cy7 (clone UCHL1 was validated here https://www.thermofisher.com/antibody/product/CD45RO-Antibody-clone- 
UCHL1-Monoclonal/25-0457-42 

CD45-PerCp-Cy5.5 (clone Hi30)was validated here http://www.bdbiasciences .com/us/applications/research/stem-cell-research/ 
cancer-research/human/percp-cy55-mouse-anti-human-cd45-hi30/ p/564105 

CD45RA-FITC or BV7 11 (clone HI100) was validated here http://www.bdbiosciences.com/eu/applications/research/t-cell- 
immunology/regulatory-t-cells/ surface-markers/human/bv71 1-mouse-anti-human-cd45re-hi100/p/563733 

CCR7-BV421 (clone 150503 }was validated here https://www.bdbiosciences.com/us/applications/research/t-cellimmunology/ 
th-2-cells/surface-markers/human/bv42 1-mouse-anti-human-cd197-ccr7-150503/p/562555 

CD122-BV5 10 (done Mik-b3 was validated here http://www.bdbiosciences.com/us/applications /research/t-cel-immunology/ 
regulatory-t-cells/surface-markers/human/bv5 10-mouse-anti-human-cd 12 2-mik-3/p/563093 

CD62L-BV605 (clone DREG-56)was validated here http://www bdbiosciences.com/us/applications/research/t-cellammunology/ 
regulatory-t-cells/surface-markers/human/bv605-mouse-anti-human-cd62}-dreg-56/p/562720 

CD4-BU V395 (clone SK3)was validated here http://www.bdbiosciences.com/us/applications/research/t-cellimmunology/th- 1- 
cells/surface-markers/human/buv395-mouse-anti-human-cd4-sk3-also-known-as-leu3a/p/563550 
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CD8-BUV805 (clone SK1) was validated here http://www.bdbiosciences.com/eu/reagents/research/antibodies-buffers/ 
immunology-reagents/anti-human-antibodies/cell-surface-antigens/buv805-mouse-anti-human-cd8-sk1/p/612890. This product 
is now discontinued 

1A7 Anti-idiotype to detect GD2-CAR was first validated here Sen et al J Immunother 1998 and further utilized and validated in 
multiple publications (Long AH et al, Nat Med 2015; Alfonso M, J Immunol 2002; Mount CW, Nat Med 2018) 

CD19-Idiotype provided by Laurence Cooper was validated here https://www.ncbi.nim.nih.gov/pmc/articles/PMC3585808/ and 
further utilized in multiple publications (Lee DW, The Lancet 2015; Long AH, Nat Med 2015; Kochenderfer JN, J Clin Oncol 2015; 
Fraietta JA, Nat Med 2018) 

c-Jun (60A8), IRF4(4964), JunB(C37F9), BATF(D7C5), were validated by manufacturer (https://www.cellsignal.com) as well as by 
our lab using CRISPR-Cas9-edited human primary T cells followed by western blotting and/or flow cytometry. 

BATF3 (AF7437) was validated by our lab using CRISPR-Cas9-edited human primary T cells followed by western blotting, as well 
as here https://www.rndsystems.com/products/human-batf3-antibody_af7437. 

GAPDH (D46CR) was validated here https://www.cellsignal.com/products/primary-antibodies/gapdh-d4c6r-mouse-mab/97166? 
N=4294956287&Ntt=gapdh&fromPage=plp 

b-Actin-HRP (clone AC-15) was validated here https://www.sigmaaldrich.com/catalog/product/sigma/a3854 ?lang=en&region=US 
P-c-JunSer73 (D47G9) was validated in our lab using CRISPR-Cas9-edited human primary T cells and overexpression vectors 
followed by western blotting. Further validation here https://www.cellsignal.com/products/primary-antibodies/phospho-c-jun- 
ser73-d47g9-xp-rabbit-mab/3270. 

Histone 3 (1B1B2) was validated here https://www.cellsignal.com/products/primary-antibodies/histone-h3-1b1b2-mouse- 
mab/14269. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) Nalm6-GL was originally provided by Steve Grupp (CHOP). EW8 and TC32 were provided by Lee Helman (NCI). The 143b line 
was provided by C. Khanna (NCI, NIH); Kelly from C. Thiele (NCI, NIH); 293GP line by the Surgery Branch (NCI). 293T cells were 
obtained from the American Type Culture Collection (ATCC, Manassas) 


Authentication STR DNA profiling of all cell lines is conducted by Genetica Cell Line testing once per year. 
Mycoplasma contamination Before using for in vivo experiments, cell lines are tested with MycoAlert detection kit (Lonza). All cell lines tested negative. 


Commonly misidentified lines None of the cell lines used in this study are included in the commonly misidentified cell lines registry. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals 6-8 week old male or female NOD/SCID/IL2Rg (NSG) mice were used for all in vivo experiments. 
Wild animals The study did not involve wild animals. 
Field-collected samples The study did not involve samples collected from the field. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Buffy coats from anonymous healthy (male and female human, ages 30-50) were purchased from the Stanford University Blood 
Bank. 


Recruitment Written informed consent was obtained from all healthy donors. 


ChIP-seq 


Data deposition 


Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links GEO Accession number: GSE136853 
May remain private before publication. 


Files in database submission 


Genome browser session http://genome.ucsc.edu/s/snagara/Lynn%20et%20al%20enhanced%20CAR%2DT%20ChIP%2Dseq 
(e.g. UCSC) 
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Methodology 


Replicates ChIP-seq was performed in both JUN and Control (HA only) CAR-T cells in two biological replicates. Each replicate was a 
distinct viral transduction. 


Sequencing depth All ChIP-seq experiments were performed as paired-end sequencing at 2x75 bp length. 
Unique reads: 
ha-1-inp 89644144 
ha-1-irf4 68466420 
ha-1-jun 45203466 
ha-2-inp 61388074 
ha-2-irf4 39513462 
jun-1-inp 61213392 
jun-1-irf4 55079410 
jun-1-jun 46053188 
jun-2-inp 54899776 
jun-2-irf4 50793788 
jun-2-jun 42178818 
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Antibodies JUN antibody: Active Motif, cat. no. 39309 
RF4 antibody: Abcam, cat. no. ab101168, lot. no. GR139704-21 


Peak calling parameters Reads were mapped using bowtie2 version 2.2.4 with the --very-sensitive flag. Peaks were called using MACS2 version 
2.1.1.20160309 callpeak. 

HA-1 IRF4 and JUN peaks were called over ha-1-inp.sort.rmdup.bam 

HA-1 IRF4 peaks were called over ha-2-inp.sort.rmdup.bam 

JUN-1 IRF4 and JUN peaks were called over jun-1-inp.sort.rmdup.bam 

JUN-2 IRF4 and JUN peaks were called over jun-2-inp.sort.rmdup.bam 


Data quality ChIP peak enrichment was calculated using ENCODE QC parameters. Briefly, peak enrichment quality control was performed 
by downsampling bam files to 20 million reads using picardtools DownsampleSam and calling peaks again. ChIP and input 
coverage at these peaks was calculated using HOMER annotatePeaks.pl in 1 bp bins for 5 kb flanking each peak center. 
Enrichment at each base pair was calculated as (ChIP coverage/input coverage) and overall peak enrichment was defined as 
the maximum enrichment in the 500 bp flanking the peak center. 

Sample / Number of peaks called / Peak enrichment: 
ha-1-irf4 /52270 /5.96 

ha-1-jun / 1712 / 1.61 

ha-2-irf4 / 55087 / 9.59 

jun-1-irf4 / 43398 / 5.80 

jun-1-jun / 7857 / 3.35 

jun-2-irf4 / 43639 / 8.90 

jun-2-jun / 9703 / 3.39 


Software Sequencing adaptors were trimmed using cutadapt version 1.11 and aligned to the hg19 reference genome using bowtie2 
version 2.2.4 with the --very-sensitive flag. Aligned reads with quality less than 10 removed and remaining reads were sorted 
using samtools version 1.2. PCR duplicates were removed using picardtools version 1.128 MarkDuplicates. 

For JUN ChIP-Rx, reads were first aligned to the hg19 genome and unmapped reads were mapped again to the dm6 
Drosophila genome. Drosophila spike-in reads were filtered and deduplicated using the same parameters as described 
above. Sample H3K27me3 ChIP-seq were then normalized to reference-adjusted reads per million (RRPM), with 
normalization factor Nd = 1,000,000/(number of unique Drosophila reads) used to make bigwig files for visualization. 

For track visualization, each bam file was normalized to one million reads (rpm) or RRPM for JUN ChIP using bedtools version 
2.19.1 genomecov -bg -split and then converting to bigwig format with bedGraphToBigWig. Tracks were displayed using the 
UCSC genome browser. Peaks were called using macs2 version 2.1.1.20160309 callpeak over input controls. 

Correlation plots were generated by counting reads within merged peaks using bedtools multicov. Enrichment plots were 
generated using HOMER by counting reads in 5 bp bins surrounding peak centers. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


| |All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation Up to 1 million T cells from culture were washed with PBS + 2% FBS (FACS Buffer), labeled in 100uL FACS Buffer containing the 
relevant antibodies, and incubated at 4C in the dark for 20 minutes. Samples were washed 2X in 1mL FACS Buffer before running. 


Instrument BD LSRFortessa X-20 
Software FACSDiva for collection and FlowJo for analysis. 


Cell population abundance 


Gating strategy All samples are gated on FSC/SSC lymphocyte populations, single cells (Using FSC-W/FSC-H and SSC-W/SSC-H), then either CD4 
or CD8+. Frequency of positive gates is determined using an isotype or FMO control. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Developmental ROS individualizes 
organismal stress resistance and lifespan 
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Acentral aspect of aging research concerns the question of when individuality in 
lifespan arises’. Here we show that a transient increase in reactive oxygen species 
(ROS), which occurs naturally during early development ina subpopulation of 
synchronized Caenorhabditis elegans, sets processes in motion that increase stress 
resistance, improve redox homeostasis and ultimately prolong lifespan in those 
animals. We find that these effects are linked to the global ROS-mediated decrease in 
developmental histone H3K4me3 levels. Studies in HeLa cells confirmed that global 


H3K4me3 levels are ROS-sensitive and that depletion of H3K4me3 levels increases 
stress resistance in mammalian cell cultures. In vitro studies identified SET1/MLL 
histone methyltransferases as redox sensitive units of the H3K4-trimethylating 
complex of proteins (COMPASS). Our findings implicate a link between early-life 
events, ROS-sensitive epigenetic marks, stress resistance and lifespan. 


Genetic effects are estimated to account for only 10-25% of the observed 
differences in human lifespan‘. However, the remaining differences are 
not entirely attributable to environmental factors. Even when isogenic 
animals, such as Caenorhabditis elegans, are cultivated under identi- 
cal environmental conditions, individual lifespans can vary by more 
than 50-fold. These results suggest that other, more stochastic factors 
account for variations in lifespan. Previous studies in C. elegans revealed 
that as early as day 1 of adulthood, subpopulations of longer-lived 
animals emerge’. We therefore focused on the concept that specific 
fluctuating signals during development might differentially affect 
processes that determine lifespan. We investigated the idea that ROS? 
might serve as early lifespan-determining modulators in C. elegans. This 
idea was built on C. elegans studies, which showed (1) that significant 
lifespan extension occurs following exposure to pharmacologically 
generated ROS in young adults‘ or by altering mitochondrial activ- 
ity during development’, (2) that exposure of nematodes to non-lethal 
concentrations of ROS leads to increased stress resistance and lon- 
gevity, aphenomenon termed mitohormesis*’ and (3) that individual 
larvae of asynchronized wild-type population exhibit large variations 
in endogenous ROS levels’. 


Early-life ROS affect adult redox states 


To investigate whether and how developmental ROS levels affect C. 
elegans later in life, we used wild-type N2 worms that ubiquitously 
express the integrated redox-sensing protein Grxl-roGFP2, which faith- 
fully responds tothe cellular ratio of oxidized and reduced glutathione 
(GSSG:GSH)*. Consistent with previous peroxide measurements’, L2 
larvae revealed a significantly more oxidizing redox environment and 
substantially larger individual differences than young adults, which 


exhibited a maximally reduced environment with smaller inter-individ- 
ual differences (Extended Data Fig. 1a). With increasing age, the average 
redox state became more oxidizing and individual differences in redox 
status re-emerged. Subsequent analysis of about 16,000 age-synchro- 
nized L2 larvae using a reconfigured large particle BioSorter (Extended 
Data Fig. 1b) confirmed our microscopy studies and showed that the 
GSSG:GSH ratio varies widely among individuals (Fig. 1a). We sorted 
and binned L2 worms with redox states 2-3 standard deviations above 
(L2™) or below (L2™) the mean population (L2™") (Fig. 1a, Extended 
Data Fig. 1c), and confirmed their different redox states by fluorescence 
microscopy (Fig. 1b, Extended Data Fig. 2a—d). Notably, L2™ and L2™*4 
worms did not differ significantly in size, reproductive activity, mito- 
chondrial respiratory chain function or glycolytic flux (Extended Data 
Fig. 3a-f), excluding the possibility that more extreme early-life redox 
states affect development or other relevant physiological parameters. 
Subsequent redox analysis of sorted L2™ and L2"4 worms showed that 
allanimals become similarly reduced in young adulthood and become 
more oxidized as they age (Fig. 1c). By day 7 of adulthood, however, 
the L2™ worms were significantly more reduced than the L2" worms. 
The trigger of the transient increase in GSSG:GSH ratios during early 
development and the mechanisms that cause the observed switch in 
endogenous redox states during adulthood are unknown. However, 
our results demonstrate that a synchronized population of C. elegans 
larvae contains subpopulations with redox environments that imprint 
information that becomes relevant later in life. 


Early-life ROS extend lifespan 


To investigate potential downstream effects of the observed varia- 
tions in developmental redox levels, we compared stress resistance 


‘Department of Molecular, Cellular and Developmental Biology, University of Michigan, Ann Arbor, MI, USA. ?State Key Laboratory of Bioreactor Engineering, East China University of Science 
and Technology, Shanghai, China. °Shanghai Collaborative Innovation Center for Biomanufacturing (SCICB), Shanghai, China. ‘Department of Pathology, Michigan Medicine, Ann Arbor, MI, 
USA. °State Key Laboratory of Molecular Biology, National Center for Protein Science Shanghai, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and 


Cell Biology, Chinese Academy of Sciences, Shanghai, China. *e-mail: ujakob@umich.edu 


Nature | Vol576 | 12 December 2019 | 301 


Article 


a b Cc 12 
= 5 P=0.23 P=0.013 P=0.0009 P=0.0001 ete” 
30- S z 0.555 P=0.001 P=5.6x10 = [0k = 0) = (0 = 0. 0 Loted 
2 SS SO 
S 8 8 ; P=0.0008 ! og 10 a ° 
& & 3 fofe) w g 20 c 
8 | § 0.504 a 0% Spall ° 3 @ ¥ |z 
oO Sa = fe) Oo fe) a . ° 
g r 2 ogee peessses 5 200 3 ie - = 2 8 z 
fog * ® 0.454 £ wee ef 
& 46 = OH e400 Sree? 1 06} a oc na Wet os 
i ee) Sg00° 3 P 5 n=27 |n=44 
8 404 Time of flight ft 6 ooo 6 oye & es n=42 
3 8 0.404 0.4]r=30_ se 
® r E s 
[oa - = = = = 
d * n=31 n=39 n=32 n oar, 
L____genseal eeso—— 9 0.35 0.2 
5 6 t 8 9 Lax L2mean Lared L2 Day 1 Day 3 Day 5 Day 7 
Grx1-roGFP2 ratio (Biosorter) Adulthood 


Fig. 1| Endogenous redox state in an age-synchronized population of C. 
elegans larvae. a, Distribution of intracellular glutathione redox potential 
(Grx1-roGFP2 ratio) of an L2-staged N2/rls2[Prpl-17::Grx1-roGFP2] population. 
L2 worms with Grx1-roGFP2 ratios between 2 and 3 standard deviations above 
(L2™; red line and inset, R2) or below (L2"**; blue line and inset, R4) the mean 
were sorted and compared with worms with mean Grx1-roGFP2 ratios (L2™"; 
green line and inset, R3).n=15,599 worms. b, Representative microscopy 
analysis of the Grx1-roGFP2 ratio of individual worms (dots) of the L2™, L2™™" 


and lifespan of the sorted worms. We found that compared with L2™4 
worms, L2™ worms were significantly more resistant to heat shock 
(Extended Data Fig. 3g), about 30% longer lived after the initial heat- 
shock treatment (Fig. 2a) and substantially longer lived when grownin 
the presence of oxidants such as paraquat (Fig. 2b) or juglone (Extended 
Data Table 1). Moreover, L2™ worms displayed an increase of up to 18% 
in median lifespan and a 1—4-day increase in maximal lifespan (Fig. 2c, 
Extended Data Fig. 2e-i, Extended Data Table 2). This clear correlation 
between increased GSSG:GSH ratios, stress resistance and lifespan 
suggested that a subpopulation of synchronized worms undergo natu- 
rally occurring hormesis during early development. Indeed, when we 
generated a more oxidizing environment in an entire population of 
worms by exposing them to 1 mM paraquat for 10 h during their L2 
stage, the entire population became longer-lived (Fig. 2d with inset). 
Treatment of L2 worms with10 mM N-acetylcysteine (NAC) for 10 hdid 
not substantially alter the redox state of the population and had no 
significant lifespan effect. By conducting the same experiments with 
previously sorted subpopulations, however, we found that a10-h NAC 
treatment decreased the lifespan of L2™ worms but not of the already 
reduced L2' worms, whereas a 10-h paraquat exposure increased 
the lifespan of L2"4 worms but not of the already oxidized L2™ worms 
(Fig. 2e, f, Extended Data Table 3). These results provide evidence that 
transient changes inthe redox environment during early development 
are sufficient to positively affect the lifespan of C. elegans. 


Early-life ROS reduce H3K4me3 levels 


To gain insights into the mechanisms by which a transient increase in 
the cellular redox state during development might cause anincreasein 
stress resistance and lifespan, we first conducted quantitative PCR with 
reverse transcription (RT-PCR) to examine changes in mRNA levels of 
commonly assessed heat-shock- (Fig. 3a) and oxidative stress-related 
genes (Extended Data Fig. 4a). Unexpectedly, L2™ and L2"4 worms 
did not significantly differ in the steady-state expression levels of any 
of these genes. However, on exposure to heat-shock conditions, L2™ 
worms showed a significantly increased capacity to upregulate heat- 
shock gene expression compared with L2"* worms (Fig. 3a). Changes 
in transcript levels of heat-shock factor HSF-1 were not significantly 
different, suggesting that the transcriptional stimulation in L2™ worms 
is aresult of either specific changes in HSF-1 activity, its subcellular 
localization or the accessibility of heat-shock promoter’. 
Subsequent RNA-sequencing analysis from four independent 
large-scale sorting experiments identified 191 upregulated and 136 
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andL2"¢ subpopulations. n, number of worms; one-way ANOVA with Tukey 
correction. The experiment was repeated four more times with similar results 
(see Extended Data Fig. 2a—d). c, Longitudinal analysis of the redox state. 
Sorted L2™ and L2"¢ worms were cultivated at 20 °C and the Grx1-roGFP2 ratio 
of worms in each subpopulation was determined microscopically at the 
indicated time points. n, number of worms; two-sided Mann-Whitney U-test. In 
b,c, bars represent mean and error bars shows.e.m. 


downregulated genes in L2™ worms compared with L2'4 worms 
(Extended Data Fig. 4b). We were unable to draw any clear link between 
the differentially expressed genes (DEGs) and previously identified sets 
of stress- or longevity-related genes (Extended Data Fig. 4c, Supplemen- 
tary Table 1). However, 26 of the 191 upregulated genes in L2™ worms 
overlapped with a set of 101 genes previously shown to be upregulated 
in worms lacking the absent small homeodisc protein ASH-2” (expected 
overlap ifno correlation <1; P=1.7 x10”, hypergeometric probability) 
(Fig. 3b, Supplementary Table 1). ASH-2 is acomponent of the highly 
conserved histone methylation complex COMPASS, which, together 
witha member of the SET1/MLL histone methyltransferase family (SET-2 
in C. elegans) and other partner proteins, causes trimethylation of 
lysine 4 in histone H3 (H3K4me3)". H3K4me3 is primarily found at 
transcription start sites, where the modification is thought to mark and 
maintain transcriptionally active genes”. Recent studies in C. elegans 
revealed that H3K4me3 marks associated with transcription start sites 
are set during early development and remain stable throughout life”. 
Indeed, analysis of published chromatin immunoprecipitation data 
revealed that about 25% of DEGs that we identified in L2™ worms associ- 
ates with H3K4me3 marks that appear to be set during development 
(Extended Data Fig. 4d, e). Furthermore, we found a highly significant 
overlap between DEGs in L2™ worms and DEGs in strains lacking the 
H3K4me3 readers SET-9 or SET-26", similar to the overlap between 
ash-2 knockdown and set-9 or set-26 deletion strains (Extended Data 
Fig. 4f, g). On the basis of these results, we decided to analyse the global 
H3K4me3 abundancy inL2™ and L2"4 worms by western blotting using 
antibodies against H3K4me3. We tested the subpopulations of seven 
independent sorting experiments and found a clear and highly repro- 
ducible reduction of more than 25% in global H3K4me3 levels in L2™ 
worms (Fig. 3c, Extended Data Fig. 5a). By contrast, other marks, such 
as H3K27ac or H3K27me3, were not significantly different between the 
two subpopulations (Extended Data Fig. 5b, c). These results strongly 
suggest a link between endogenous ROS levels, H3K4 trimethylation 
levels and gene regulation. 


Aredox-sensitive histone mark 

RNA-sequencing analysis of L2™ and L2" worms did not reveal any 
transcriptional changes in components of the COMPASS complex. 
This result suggested that the activity rather than the level of the 
H3K4me3 complex is affected by the redox environment. To directly 
test whether the H3K4me3 machinery is ROS-sensitive, we attempted to 
purify the C. elegans proteins for in vitro methylation assays. However, 
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Fig. 2| Oxidized L2 subpopulations show increased stress resistance 

and longer lifespan. Experiments were performed with N2jrls2/Prpl-17:: 
Grx1-roGFP2] worms sorted into L2™, L2™*" and L2"*4 subpopulations. 

a, Representative survival curves of sorted worms that survived heat-shock 
treatment. b, Representative survival curves of sorted worms cultivated on 
NGM plates supplemented with 2 mM paraquat. c, Representative survival 
curves of sorted worms. See Extended Data Fig. 2e-i for repetitions. Inset, 
Grx1-roGFP2 ratio of individual worms (dots) after sorting. n, number of 
worms; two-sided unpaired ¢-test. d, Representative survival curves of anon- 


this turned out to be impossible owing to their instability. By contrast, 
the mammalian members of the COMPASS have been previously used 
in H3K4 methylation assays in vitro’. To determine whether H3K4 
trimethylation is a redox-sensitive process in mammalian cells, we 
subjected HeLa cells to non-lethal H,O, treatment and investigated 
global H3K4me3 levels. Consistent with our findings in C. elegans, we 
observed a global decrease of about 30% in H3K4me3 levels within 
30 min of peroxide treatment (Fig. 3d, Extended Data Fig. 5d), without 
detectable changes in steady-state levels of two of the main COMPASS 
components (Extended Data Fig. 5e, f). We subsequently purified the 
mammalian core COMPASS proteins—that is, the SET domain of MLL1, 
which catalyses lysine-directed histone methylation’, ASH2L, WDR5 
(human homologue of C. elegans WDR-S.1) and RBBPS. We then treated 
the individual proteins with peroxide for 30 min, removed the oxidant, 
recombined the proteins and tested them for in vitro histone methyla- 
tion activity. Notably, among these proteins, only the SET-domain of 
MLL1 appeared to be reproducibly peroxide-sensitive (Fig. 3e, Extended 
Data Fig. 5g, h). Incubation of the peroxide-inactivated SET domain 
with thiol-reducing dithiothreitol (DTT) restored the original activity, 
strongly suggesting that thiol oxidation is responsible for the revers- 
ible inactivation (Fig. 3e, Extended Data Fig. 5g, h). Analysis of the SET 
domain of other SET1/MLL-family members”, including SETIA, the 
most closely related human homologue of C. elegans SET-2'8 (Extended 
Data Fig. Si), SETIB (Extended Data Fig. 5j) and a version of MLL1 lack- 
ing the cysteine-containing GST-tag used for purification (Extended 
Data Fig. 5h) demonstrated that sensitivity towards peroxide is a uni- 
versal feature for SET1/MLL family members. To investigate which of 
the seven cysteines in the SET domain of MLL1 might be sensitive to 
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sorted (mixed) worm population either untreated or treated at the L2 stage 
with1mM paraquat or 10 mM NAC for 10h. Inset, Grx1-roGFP2 ratio of 
individual worms (dots) after treatment. n, number of worms; one-way ANOVA 
with Tukey correction. Data in insets are mean +s.e.m.e, f, Representative 
survival curves of anL2"4(e) or L2® (f) subpopulation, either untreated or after 
a10-htreatment with 1mM paraquat or 10 mM NAC. The specific sorting 
events, number of individuals, repetitions and statistical analysis (log-rank) for 
each of the datasets shown in this figure are presented in Extended Data 

Tables 1-3. 


reversible thiol oxidation, we conducted direct (Extended Data Fig. 5k, 
1) and reverse thiol trapping (Fig. 3f) on oxidized and reduced MLL1 SET 
domains using the 500-Da thiol-reactive compound 4-acetamido-4’- 
maleimidylstilbene-2,2’-disulfonic acid (AMS), followed by SDS-PAGE”. 
Analysis of the migration behaviour of thiol-trapped oxidized versus 
reduced MLL1SET domain suggested that peroxide treatment leads to 
the formation of two intramolecular disulfide bonds. Subsequent mass 
spectrometry analysis of in vitro-modified cysteines confirmed these 
results and revealed that at least four of the five absolutely conserved 
cysteines inthe SET domain are highly sensitive to oxidation (Extended 
Data Fig. 5m-o). To our knowledge, H3K4me3 is the only histone meth- 
ylation mark and MLL1is the only histone methyltransferase that is 
known to be posttranslationally redox-regulated. 


H3K4me3 modulates stress resistance 


Our studies raised the possibility that redox-mediated inactivation 
of the lone SET1/MLL-homologue in the oxidized subpopulation of C. 
elegans (that is, SET-2) leads to a reduction in global H3K4me3 levels, 
which causes increased stress resistance and longevity. This theory was 
supported by recent studies showing that deleting or knocking down 
components of COMPASS in C. elegans increases lifespan’. Indeed, 
strains with either deleted (that is, set-2 or wdr-5.1) or depleted (that 
is, ash-2) members of COMPASS were significantly more heat stress 
resistant than respective control strains or a strain deficient in the 
H3K4me3-demethylase RBR-2 (Fig. 3g). In addition, and comparable 
to results obtained with L2™ worms, we found that worms deficient in 
H3K4me3 (Extended Data Fig. 6a—c) showed a substantially increased 
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Fig. 3| H3K4me3 is aredox-sensitive histone modification involved in stress 
gene expression and resistance. a, Transcripts of heat-shock-associated genes 
in sorted subpopulations before and after heat shock. n=3 independent 
sorting experiments; two-way ANOVA with Tukey correction. b, Venn diagram 
of upregulated genes inL2™ and ash-2 RNAi worms” (complete list in 
Supplementary Table 1).c, Quantification of global H3K4me3 levels in L2™ and 
L2"*¢ worms. n=7 independent sorting experiments; unpaired two-sided ¢test. 
d, Quantification of H3K4me3 levels in HeLa cells before and after H,O, 
treatment. n=3 independent experiments; one-way ANOVA with Dunnett 
correction. e, In vitro histone methyltransferase assays with the purified core 
COMPASS proteins MLL1(M), WDRS(W), ASH2L (A) and RBBPS (R); OX in 
superscript indicates that the proteinis pre-treated with 1 mM (+) or 2 mM (++) 
H,0, for 30 min before the activity assay. DTT was added after the H,O, 
treatment as indicated. n=3 independent experiments; one-way ANOVA with 


transcriptional response after heat shock (Extended Data Fig. 6d, e). 
We obtained very similar results with HeLa cells depleted of ASH2L 
by short interfering RNA (siRNA) treatment (Extended Data Fig. 6f), 
which were more resistant to heat stress (Fig. 3h) and demonstrated 
an augmented transcriptional response to heat stress compared with 
control siRNA-treated cells (Fig. 3i). These results strongly suggested 
that the downstream effects of H3Kme3 depletion on stress resistance 
and longevity are conserved. 

Finally, to test whether downregulation of H3K4me3 levels is suffi- 
cient to increase heat-shock resistance and lifespan in L2™ worms, we 
generated ash-2 or set-2 RNA interference (RNAi)-mediated knockdown 
worms expressing the Grx1-roGFP2 redox sensor protein and sorted the 
synchronized L2 population as before. We detected no significant dif- 
ference inthe relative distribution or range of GSSG:GSH ratios between 
knockdown and control RNAi worms (Extended Data Fig. 6g). However, 
the sorted L2% and L2™4 subpopulations of ash-2and set-2RNAi worms 
nolonger exhibited any difference in heat-shock sensitivity (Fig. 4a, b) 
or lifespan (Fig. 4c, d, Extended Data Tables 2 and 4). Similarly, there was 
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Sidak correction. f, Reverse thiol trapping of oxidized and reduced MLL1 SET 
domain. A500-Da mass increase per oxidized thiol can be detected onnon- 
reducing SDS-PAGE. M, protein marker. g, Heat-shock survival of wild-type, 
wdr-S.1, set-2, rbr-2 and set2/rbr-2 worms after 48 h (n=3 independent 
experiments; one-way ANOVA with Dunnett correction) or N2jr/s2/Prpl- 
17::Grx1-roGFP2] worms treated with ash-2 or control RNAi after 24h(n=5 
independent experiments; two-sided unpaired t-test). h, Heat-shock survival of 
HeLa cells treated with ASH2L siRNA. n=5 independent experiments; two-way 
ANOVA with Tukey correction. i, Transcript levels of heat-shock genes after 
30-min heat stress treatment of HeLa cells treated with ASH2L siRNA. NT, non- 
targeting. n=3 independent experiments; two-sided unpaired t-test. Dataina, 
c-eand g-iare mean +s.e.m. For blot and gel source images, see 
Supplementary Figs. 1-3. 


nolife-prolonging effect when we treated ash-2or set-2 RNAi worms with 
paraquat for 10 hat the L2 larval state (Extended Data Fig. 6h, Extended 
Data Table 4). These results imply that downregulation of H3K4me3 
levels is both necessary and sufficient to increase heat-shock resistance 
and lifespan in the oxidized subpopulation of worms. 


Conclusions 


In 2010, Cynthia Kenyon suggested that a stochastic event might flip 
an epigenetic switch or set in motion a chain of events that promotes 
ageing”’. Our studies have revealed that variations in endogenous ROS 
during development—potentially caused by locally different growth 
conditions—contribute to the variation in lifespan observed in syn- 
chronized populations of C. elegans. Animals that accumulate high 
levels of ROS during development apparently undergo an endogenous 
hormesis event, which, as previously observed with exogenous ROS 
treatment”, increases stress resistance and lifespan. This might serve 
as a bet-hedging strategy to provide subpopulations of worms with 
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Fig. 4 | Anintrinsically oxidizing environment confers increased stress 
resistance via downregulation of global H3K4me3 levels. a, b, Survival of 
N2jrls2[Prpl-17::Grx1-roGFP2] worms treated with ash-2 (a) or set-2(b) RNAi, 
sorted into L2™ and L2' and counted 24 hafter heat shock. n=3 (a) andn=4 (b) 
independent experiments; two-way ANOVA with Tukey correction. Data are 
mean+s.e.m.c,d, Representative survival curves of N2jrls2/Prpl-17::Grx1- 
roGFP2] worms treated with ash-2 (c) or set-2(d) RNAi and sorted into L2™ and 
L2" 4, For nnumbers, repetitions and statistics (log-rank) incandd, see 
Extended Data Tables 2 and 4. 


improved survival during stress. Our studies reveal the underlying 
mechanism of ROS-mediated hormesis by demonstrating that global 
H3K4me3 levels are redox-sensitive and decrease in response to oxida- 
tive stress. On the basis of the findings that decreased global H3K4me3 
levels increase stress resistance and C. elegans lifespan”®, we postulate 
that we have identified a stochastic event, the epigenetic switch and 
the chain of events that are set in motion during early development 
to increase lifespan. 

Recent studies in C. elegans demonstrated that H3K4me3 marks 
within gene bodies are set during adulthood and change with age”. 
By contrast, H3K4me3 enrichment at transcription start sites—which 
is thought to provide a‘memory’ of actively transcribed genes—is set 
during development and remains stable throughout the lifespan”. This 
result probably explains how transient redox-mediated changes in 
H3K4me3 levels during developmentare sufficient to exert long-lasting 
effects despite the marked changes in the redox environment during 
adulthood. Our finding that organisms with lower levels of H3K4me3 
show an increased transcriptional response to stress conditions is 
initially counterintuitive, given that H3K4me3 is widely considered an 
activating mark”. However, our results are fully consistent with yeast 
studies, which have shown that reduction of H3K4me3 levels causes 
substantially more robust gene expression changes upon stress treat- 
ment”. The extent to which this increase in transcriptional capacity 
of stress-related genes is linked to the observed lifespan extension, 
however, remains to be determined. A substantial number of genes 
involved in lipid metabolism were found to be downregulated in L2™ 
worms (Extended Data Fig. 4c, Supplementary Table 1). This is noteable, 
as increased lipid storage and altered lipid signalling have previously 
been linked to increased lifespans ofa variety of different organisms” 


and found to have arole in the extension of lifespan of worms globally 
deficient in H3K4me3 levels”. Further investigation is needed to reveal 
how the transient downregulation of H3K4me3 levels selectively during 
development can elicit similarly profound lifespan-altering effects. Our 
ability to change the lifespan of an entire population witha simple 10-h 
exposure to ROS during development suggests that we have identified 
atime window and a mechanism that helps to individualize lifespan in 
animals. This study provides a foundation for future work in mammals, 
in which very early and transient metabolic events in life seem to have 
equally profound impacts on lifespan”. 
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Methods 


Nostatistical methods were used to predetermine sample size. The experi- 
ments were not randomized. The investigators were not blinded toalloca- 
tion during experiments, unless otherwise noted, and outcome assessment. 


C. elegans strains, maintenance and lifespan assays 

The following C. elegans strains were used in this study: PBO20: 
N2jrls2[Prpl-17::Grx-1-roGFP2], N2: Wild-type Bristol isolate, RB1304: 
wadr-5.1(0k1417), RB10235: set-2(0k952), ZR1: rbr-2(tm1231) and ABR9: set- 
2(0k952),-rbr-2(tm1231). Unless stated otherwise, worms were cultured 
at 20 °C. Standard procedures were followed for C. elegans strain main- 
tenance”. Synchronization was performed using alkaline hypochlorite 
solution; eggs were allowed to hatch by overnight incubation in M9 
medium during gentle shaking. Newly hatched, arrested L1 larvae were 
transferred onto standard nematode growth medium (NGM) plates 
seeded with live F. coliOP50. Lifespan studies were performed at 20 °C 
inthe presence of fluorodeoxyuridine. Survival was scored every 2 days, 
and worms were censored if they crawled off the plate, hatched inside or 
lost vulvaintegrity during reproduction. The first day of adulthood was 
set as t= 0. Lifespan of unsorted worm populations were performed in 
a lifespan machine according to”. Survival plots were generated using 
GraphPad Prism. Lifespan data were analysed for statistical significance 
with log-rank (Mantel-Cox) or Gehan-Breslow-Wilcoxon test. 


Reconfiguration of BioSorter for ratiometric sorting 

Lasers of wavelength 405 and 488 nm were used to excite the Grx1-roGFP2 
sensor protein. Since the protein possesses a single emission maximum 
(-520 nm), the two lasers in the BioSorter (Union Biometrica) were rea- 
ligned to sequentially illuminate single L2-staged worms as they passed 
through the flow cell, without emitting overlapping signals. This enabled 
collection of signals from 405- and 488-nm lasers separately, from two 
photon multipliers tubes. Asa result, data were displayed as two groups 
of peaks (Extended Data Fig. 1b). Using the partial profiling feature (pp) 
of the FlowPilot-Pro software, we mapped the peaks corresponding to 
each laser that trace the fluorescent intensity and extinction signals. The 
extinction signal from the 488-nm laser was used to initially gate worms 
at the L2 stage larva (R1 gate, see Extended Data Fig. 1c). Oxidized, mean 
and reduced L2 wormswere sorted from R2, R3 and R4 gates respectively, 
based onthe peak 405 and 488-nm fluorescent intensities (inset in Fig. 1a). 


Microscopy 

Worms were mounted on objective slides using 4 pl thermoreversible 
CyGEL (BioStatus; Fisher Scientific) and 2 pl of SO mM levamisole for 
immobilization. Fluorescence and DIC images were acquired with an 
upright microscope equipped with a Photometrics Coolsnap HQ2 
cooled CCD camera, a UPlan S-Apo 20x objective (NA 0.75) and a X-CITE 
exacte light source equipped with a closed feedback loop. For Grx1- 
roGFP2 fluorescence, an external filter wheel was used with excitation 
filters 420/40x, 500/20x, dual bandpass dichroic T51SLPXR and asingle 
emission filter 535/30x. Image analysis was performed in Metamorph 
(Molecular Devices) using a custom script. In brief, an intensity threshold 
was chosen by the user. Pixels above this threshold constitute regions 
of interest. Regions with very high signal in any channel (for example, 
fluorescent particles) were identified by applying an over-saturation 
threshold and excluded from regions of interest. Mean ratiometric 
values (excitation 420 nm-emission 535 nm/excitation 500 nm-emis- 
sion 535 nm) of regions of interest were calculated after subtraction 
of background. Acquisition parameters were kept identical across all 
samples. For body-length measurements, worms were measured from 
the nose to the tail tip and analysis was performed with ImageJ. 


Brood size 


L4-staged worms were transferred onto NGM plates and incubated at 
15 °C. The parental animals were transferred daily to individual NGM 


plates until the end of the reproductive period. The progeny of each 
animal was counted at the L2 or L3 stage. 


Cellular respiration 

Real-time oxygen consumption rates and extracellular acidification rate 
were measured witha Seahorse XF°96 Analyzer (Seahorse Bioscience) 
as described”’. In brief, 100 L2-staged worms were sorted directly into 
individual wells of 96-well Seahorse utility plates at a final volume of 
200 pl of 10% M9. Acute effects of pharmacological inhibitors carbonyl 
cyanide-4(p-trifluor-methoxy)phenylhydrazone (FCCP), an accelera- 
tor of the electron transport chain (ETC) and sodium azide (NaN;), a 
complex IV and V inhibitor, were evaluated by injecting them during 
the run at final concentrations of 20 uM and 40 mM respectively. 


Heat-shock treatment 

Heat shock was performed on solid OPS0-seeded NGM plates wrapped 
in parafilm and submerged ina pre-heated water bath. For thermotoler- 
ance assays, worms were heat-shocked for 45 min at 38 °C. Survival was 
scored after 24 hand then until the death of the last worm by absence of 
touch response or pharyngeal pumping. For transcriptional response 
assays, worms were heat-shocked for 30 min at 35 °C. After 1h recovery 
at 20 °C, worms were collected and snap-frozen in liquid nitrogen. All 
heat-shock treatments were applied to worms at the L2 stage. 


Treatments with NAC and paraquat 

For survival assays, worms were cultivated on solid OPS50-seeded NGM 
plates, supplemented with the indicated concentrations of NAC or 
paraquat. Survival was determined by absence of touch response or 
pharyngeal pumping. For transient exposure to compounds, worms 
were transferred into M9-media supplemented with OPSO and the 
indicated concentrations of NAC or paraquat for 10 h. Worms were 
harvested, washed three times with M9 and transferred onto regular 
OP50-seeded NGM plates. 


RNA extraction and real-time quantitative PCR 

Wormsat the L2 stage (3,000-5,000; whole population or after sorting) 
were grounded in Trizol reagent (Life Technologies) with sea sand and 
a pestle. After filtering of the sand, samples were vigorously shaken 
with chloroform, allowed to stand for 3 min at room temperature, and 
then centrifuged at 16,000g at 4 °C. The aqueous phase was then col- 
lected and RNA was purified using QIAGEN RNeasy RNA extraction 
columns according to the manufacturers’ recommendations. HeLa cells 
were directly lysed in the culture dish by adding Trizol, as per manufac- 
turers’ recommendations. For RNA isolation, after addition of ethanol, 
the lysate was loaded onto QIAGEN RNeasy RNA extraction columns. 
cDNA synthesis was performed using PrimeScript Ist strand cDNA Syn- 
thesis Kit (Takara) and real-time quantitative PCR was performed using 
Radiant Green Lo-ROX qPCR Kit (Alkali Scientific), as per manufacturers’ 
recommendations, inan Eppendorf Mastercycler epgradient S realplex” 
detection system. Relative expression was calculated from Cycle thresh- 
old values using the2 “““tmethod and the expression of genes of inter- 
est were normalized to housekeeping genes (cdc-42, pmp-3, panactin) 
and/or spiked-in luciferase (10 pg mI” Trizol). 

Primers used were cdc-2: 5’-AGCCAT TCTGGCCGCTCTCG-3’ and 
5’-GCAACCGCCTTCTCGTTTGGC-3’; pmp-3: 5’-TTTGTGTCAATTGGTC 
ATCG-3’ and 5’-CTGTGTCAATGTCGTGAAGG-3’; panactin: 5’-TCGG 
TATGGGACAGAAGGAC-3’ and 5’-CATCCCAGT TGGTGACGATA-3’; sod- 
1: 5’-AAAATGTGGAACCGTGCTG-3’ and 5’-TGAACGTGGAATCCATG 
AA-3’; sod-2: 5’-GATT TGGAGCCTGTAATCAGTC-3’ and 5’-GAAGAGCG 
ATAGCTTCTTTGAC-3’; sod-3: 5’-CACTAT TAAGCGCGACTTCGG-3’ 
and 5’-CAATATCCCAACCATCCCCAG-3’; ctl-2: 5’-ATCCCAACATG 
ATCTTTGA-3’ and 5’-TGAGAT TCT TCACTGGTTG-3’; prdx-2: 5’-CGACTC 
TGTCTTCTCTCAC-3’ and 5’-GAAGATCATTGATGGTGAT-3’; aak-2: 
5’-AAGTCTGGAGTTGGGAATACG-3’ and 5’-GTATGCACTTCTTTGTGG 
AACC-3’; hsf-1: 5’-TCCGTATAAGAATGCGACTAGG-3’ and 5’-TAGCTTCTG 


ATGTGGTTGAAGG-3’; hsp-1: 5’-GGACGTCTTTCCAAGGATGA-3’ and 
5’-TCAAGATCTCGTCGACTTG-3’; hsp-16.2: 5’-CTGTGAGACGT TGAGATT 
GATG-3’ and 5’-CTTTACCACTAT T TCCGTCCAG-3’; ash-2: 5’-CGATCG 
AAACACGGAACGA-3’ and 5’-TGCCGGAATCTGCAGTTTTT-3’, set-2: 
5’-TCGAAGATTGAAGGTGAAGAGAG-3’ and 5’-ATCATCTTTTTGCGG 
AACTGTAA-3’; HSPD1: 5’-TGCTGAGTTTTGAATGAGCAA-3’ and 
5’-CAATCTGCTCTCAAATGGACA-3’; Hsp90AAI1: 5’-GAAATCTGTAGAACC 
CAAATTTCAA-3’ and 5’-TCTT TGGATACCTAATGCGACA-3’; luciferase: 
5’- ACGTCTTCCCGACGATGA-3’ and 5’-GTCTTTCCGTGCTCCAAAAC-3’. 


RNA-sequencing analysis 

Total RNA from four biological replicates of worms sorted at the L2 
stage (extracted as described above) was assessed for quality using 
the TapeStation (Agilent). Samples were prepared using the Illumina 
TruSeq Stranded Total RNA Library Prep kit (Illumina). Total RNA 
(100 ng) was rRNA-depleted using Ribo-Gone (Takara Bio). The rRNA- 
depleted RNA was then fragmented and copied into first strand cDNA 
using reverse transcriptase and random primers. The products were 
purified and enriched by PCR (15 cycles) to create the final cDNA library. 
The 3’ prime ends of the cDNA were adenylated and ligated to adapt- 
ers, including a 6-nt barcode unique for each sample. Final libraries 
were checked for quality and quantity by TapeStation and qPCR using 
Kapa’s library quantification kit for IIlumina Sequencing platforms 
(Kapa Biosystems). The samples were pooled, clustered onan Illumina 
cBot and sequenced on one lane of an Illumina HiSeq4000 flow cell, 
as paired-end 50-nt reads. The quality of the raw reads data for each 
sample (for example, low-quality scores, over-represented sequences, 
inappropriate GC content) was checked using FastQC (v.0.11.3). The 
Tuxedo Suite software package was used for the computational analysis 
of the RNA sequencing””””. In brief, reads were aligned to the reference 
genome WS220 using TopHat (v.2.0.13) and Bowtie2 (v.2.2.1.). Cufflinks/ 
CuffDiff (v.2.1.1) was used for expression quantitation, normalization, 
and differential expression analysis, using reference genome WS220. 
For this analysis, we used parameter settings: “—-multi-read-correct” 
to adjust expression calculations for reads that map in more than one 
locus, as well as “-compatible-hits-norm’” and “—upper-quartile-norm” 
for normalization of expression values. Diagnostic plots were gener- 
ated using the CummeRbund R package. Genes and transcripts were 
identified as being differentially expressed based on three criteria: 
test status = “OK”, FDR< 0.05, and fold change = +1.5. The Bioconduc- 
tor Package GSA was used to perform enrichment test analysis. The 
algorithm was modified from the original gene set enrichment analysis 
(GSEA)”, for better power. Gene sets were downloaded from sources 
indicated in Supplementary Table 1. All FDR corrected P values in this 
result are extremely significant (FDR = 0). 


Western blot 

Standard methods for western blotting were used for the detection 
of proteins from worm lysates. In brief, 3,000-5,000 L2-staged worms 
were collected in 20 pl of M9 buffer and snap frozen in liquid nitrogen. 
Laemmli loading buffer was added to the worm pellet (1:1 volume) 
and the samples were boiled for 5 min, separated by SDS-PAGE and 
transferred to PVDF membranes. Blots were blocked for 1h with 5% 
milk in PBS and probed with anti-H3 (Abcam, ab1791; 1:2,000), anti- 
H3K4me3 (Abcam, ab8580; 1:1,000), anti-H3K27ac (Abcam, ab4729; 
1:1,000), anti-H3K27me3 (Millipore, 07-449; 1:1,000), anti-ASH-2 
(Abmart, X3-G5EFZ3, 1:1,000;) or anti-B-tubulin (Santa Cruz, sc-5274; 
1:2,000) primary antibodies overnight at 4 °C. For the extraction of 
mammalian proteins, HeLa cells were treated with trypsin (Invitrogen, 
25200056) washed twice with PBS and collected in lysis buffer (RIPA 
buffer, 1 mM PMSF, protease inhibitor cocktail and 1mM EDTA; 0.5 
ml lysis buffer per 5 x 10° cells). Samples were incubated for 45 min at 
4 °C with constant agitation. Lysates were spun down (4 °C, 20 min, 
12,000 rpm) and snap frozen in liquid nitrogen. Laemmliloading buffer 
was added to the lysates (1:1 volume) and the samples were boiled for 


5 min, separated by SDS-PAGE and transferred to PVDF membranes. 
Blots were blocked for 1h with 5% milk in PBS and probed with anti-H3 
(Abcam, ab1791; 1:2,000), anti-H3K4me3 (Abcam, ab8580; 1:1,000), 
anti-ASH2L (Bethyl laboratories, polyclonal, A300-489A; 1:1,000), 
anti-MLL1 (Bethyl laboratories, polyclonal, A300-374A; 1:500) or anti- 
B-tubulin (Santa Cruz, sc-5274; 1:2,500) primary antibodies overnight 
at 4 °C. HRP conjugated anti-rabbit (ThermoScientific, 31460) and anti- 
mouse (ThermoScientific, 31430) secondary antibodies were used at 
1:5,000 dilution for 1h at room temperature. Proteins were detected 
using Clarity ECL western blotting substrate (BioRad) and signal was 
captured using a BioRad ChemiDoc Touch imaging system. 


C. elegans RNAi 

Escherichia coli HT115 (DE3) strains transformed with vectors express- 
ing dsRNA of the genes of interest (ash-2, set-2, wdr-5.1,empty pL4440) 
were obtained from the Ahringer library (a gift from G. Csankovszki), 
sequence-verified and grown at 37 °C as per manufacturer’s recom- 
mendations. L1 worms obtained from synchronized populations were 
placed onto NGM plates containing ampicillin (100 mg mI”) and IPTG 
(0.4 mM) seeded with the respective bacteria. Worms were cultivated 
oneither RNAi or the empty vector control bacteria for two generations. 


Mammalian cell culture and H,O, treatment 

HeLa (EM-2-11ht) cells (a gift from J. Nandakumar and authenticated 
by STR) were cultured in DMEM (Life Technologies, 11995-065), sup- 
plemented with 10% Fetal Bovine Serum (Sigma-Aldrich, F4135) and 1% 
penicillin-streptomycin (Gibco, 15140-122) at 5% CO,. At 80% conflu- 
ency, cells were washed with PBS (Life Technologies, 10010023) and 
treated with HBSS (Life Technologies, 14025-092) supplemented with 
0.1mM or 0.3 mM H,0O, and incubated at 37 °C for 30 min. 


Mammalian siRNA and heat-shock treatment 

Eight thousand cells were transfected with 4.8 pmol of ASH2L siRNA 
(Dharmacon, M-019831-01-0005) or non-targeting siRNA (Dharmacon, 
D-001210-02-05) using Lipofectamine RNAiMax (Invitrogen, 13778- 
150) in OPTI-MEM I Reduced serum medium (Gibco, 31985-062). For 
heat-shock treatment, cells were washed with PBS 72 h after siRNA 
transfection and placed in HBSS. Plates were wrapped with parafilm 
and submerged in a pre-heated water bath at 43 °C for the indicated 
time points. Viability was determined using the CellTiter-Glo Kit (Pro- 
mega) as per manufacturer’s recommendations. Luminescence was 
monitored on a FLUOstar Omega microplate reader (BMG Labtech). 


Histone methyltransferase activity assays 

SET domains of SET1/MLL family proteins (MLL1, SET1A, SET1B), RBBP5 
(full length), ASH2L (full length), and WDRS (full length) were purified 
as previously described*. The purified proteins were diluted to10 uM 
and incubated with 1 mM or2mMH,0, at 4 °C for one hour in the buffer 
25 mM Tris-HCl, pH 8.0. For DTT recovery, 4 mM DTT was added after 
H,O, treatment and was incubated at 4 °C for 60 min. After oxidation, 
excess H,O, was removed by ultrafiltration. Methyltransferase assays 
were performed using H3 peptides (residues 1-20) with one additional 
Tyr-residue at the C terminus for accurate quantification of peptides. 
Anenzyme-coupled continuous spectrophotometric assay system 
was employed to monitor the time course of the reaction”. This 
assay system, which monitors the appearance of the cofactor product 
(SAH) at an absorbance of 515 nm (that is, OD,,,) contained the follow- 
ing components: 25 mM Tris (pH 8.0), 320 nM AdoHcy nucleosidase, 
480 nM adenine deaminase, 40 UI" xanthine oxidase, 20,000 UI? 
horseradish peroxidase, 4.5 mM 3,5-dicholoro-2-hydroxybenzene- 
sulfonic acid, 0.894 mM 4-aminophena-zone, 40 LM MnCl, 2.25 uM 
K,Fe(CN),:3H,O, 200 pM S-adenosyl-methionine and 1 pM of the four 
mammalian proteins that constitute the minimal H3K4-methylating 
complex. All components were mixed in 30 pl volume in 384-well plate 
at room temperature, and the reaction was initiated by adding 400 uM 
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H3 peptide substrate. The OD,,, was monitored using a Synergy Neo 
Multi-Mode Reader (Bio-Tek) for 1h at 28 °C. The slope of OD;,; versus 
time from the first 20 min linear range was converted into reaction 
rates. The relative activity for each complex without any H,O, pretreat- 
ment was set to 1. A buffer control was used to determine the baseline. 


In vitro protein oxidation and thiol trapping 

Purified MLL1 SET domain (15 pM) was treated with either 2mM DTT or 
2mMH.0O, for 30 min at 4 °C or 30 °C. To stop the reaction, the H,O,- 
treated samples were mixed with catalase (0.5 mg ml). To reduce 
reversible thiol modifications the oxidized sample were treated with 
4mM DTT for 30 min at 30 °C. The reduced cysteines were blocked 
with 20 mM NEM (N-ethylmaleimide) before SDS-PAGE analysis. For 
reverse thiol trapping experiments, the samples were resuspended in 
a denaturing thiol-trapping buffer (2.3 M urea, 0.2% SDS, 10 mMEDTA, 
200 mM Tris-HCl, pH 8.5) supplemented with 20 mM NEM for 30 min 
at 25 °C. Proteins were precipitated with 10% trichloracetic acid (TCA). 
After centrifugation, the pellets were washed with 10% TCA and 5% 
TCA and re-dissolved in the denaturing thiol-trapping buffer supple- 
mented with 4 mM DTT to reduce reversible thiol modifications. After 
45 min of incubation at 30 °C, all new cysteine thiols were labelled with 
25 mM AMS for 5 min at 25 °C. Proteins were analysed on SDS-PAGE 
under non-reducing conditions and visualized using silver staining. For 
the mass spectrometry analysis of the cysteine-containing peptides, 
iodoacetamide (IAM) was used instead of AMS to label reversibly oxi- 
dized cysteines. After SDS-PAGE under non-reducing conditions and 
Coomassie staining, protein bands were cut out, trypsin-digested, and 
analysed by nano liquid chromatography with tandem mass spectrom- 
etry (LC-MS/MS; MS Bioworks). 


Statistical analysis 

The Prism software package (GraphPad Software 7) and the Microsoft 
Office 2010 Excel software package (Microsoft Corporation) were used 
to carry out statistical analyses. Information about statistical tests, 
Pvalues and n numbers are provided in the respective figures and fig- 
ure legends. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All relevant data are available and/or included with the manuscript as 
Source Data or Supplementary Information. RNA-sequencing data 


have been uploaded to the Gene Expression Omnibus (GEO) database 
with accession number GSE138502. 


26. Brenner, S. The genetics of Caenorhabditis elegans. Genetics 77, 71-94 (1974). 

27. Stroustrup, N. et al. The Caenorhabditis elegans Lifespan Machine. Nat. Methods 10, 
665-670 (2013). 

28. Koopman, M. et al. A screening-based platform for the assessment of cellular respiration 
in Caenorhabditis elegans. Nat. Protocols 11, 1798-1816 (2016). 

29. Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient 
alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009). 

30. Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA- 
Seq. Bioinformatics 25, 1105-1111 (2009). 

31. Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach 
for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 
15545-15550 (2005). 

32. Ng, S.B. et al. Exome sequencing identifies MLL2 mutations as a cause of Kabuki 
syndrome. Nat. Genet. 42, 790-793 (2010). 

33. Dorgan, K. M. et al. An enzyme-coupled continuous spectrophotometric assay for 
S-adenosylmethionine-dependent methyltransferases. Anal. Biochem. 350, 249-255 
(2006). 

34. Southall, S. M., Cronin, N. B. & Wilson, J. R. A novel route to product specificity in the 
Suv4-20 family of histone H4K20 methyltransferases. Nucleic Acids Res. 42, 661-671 
(2014). 


Acknowledgements We thank M. Malinouski and T. Mullins for assistance with the 
reconfiguration of the Biosorter; G. Csankovszki for antibodies, C. elegans RNAi feeding clones 
and comments; B. Braeckman for the N2jrls2[Prpl-17::Grx-1-roGFP2] strain; J. Nandakumar for 
HeLa (EM-2-11ht) cells; the Caenorhabditis Genetics Center (funded by National Institutes of 
Health Infrastructure Program P40 ODO10440) for C. elegans strains; the DNA Sequencing 
Core (BRCF), R. Tagett, W. Wu and the Bioinformatics Core of University of Michigan for RNA 
sequencing and data analysis; K. Wan for protein purification; R. Sawarkar and J. Labbadia for 
important suggestions; Jakob laboratory members for comments on the manuscript and 

J. Bardwell for critically reading the manuscript. Mass spectrometry was performed by MS 
Bioworks. This work was supported by NIH grants GM122506 and AGO46799 as well as the 
Priority Program SPP 1710 of the Deutsche Forschungsgemeinschaft (Schw823/3-2) to U.J.,a 
NIH T32 Career Training in the Biology of Aging grant to D.B., a NIH T32 Career Training in the 
Biology of Aging grant and a Bright Focus ADR Fellowship (A2019250F) to B.J.O., and the 
National Natural Science Foundation of China (31470737) to Y.C. 


Author contributions D.B. conceived and conducted most experiments, performed data 
analysis and wrote the manuscript; D.K. conceived experiments and initiated work with the 
BioSorter; Y.Z. performed the in vitro methyltransferase assays; K.U. performed the reverse 
thiol trapping and prepared samples for mass spectrometry analysis; B.J.O. assisted with RNAi 
experiments and western blots for methylation marks in C. elegans; L.X. performed siRNA in 
HeLa cells; M.K. built and operated the lifespan instrument; A.K. assisted with worm sorting 
and produced brood size data; Y.-T.L. purified mammalian proteins; Y.D. conceived 
experiments and provided material; S.Q. and Y.C. conceived experiments; U.J. conceived 
experiments, conducted data analysis and wrote the manuscript. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-019- 
1814-y. 

Correspondence and requests for materials should be addressed to U.J. 

Peer review information Nature thanks Anne Brunet, Michael Ristow and the other, 
anonymous, reviewer(s) for their contribution to the peer review of this work. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


25 a b b c 


P=0.4x10* P=0.02 


roGFP2-Grx1 ratio 


Fluorescent Signal 


Axial Length (Time of flight) 


Extended Data Fig. 1| In vivo readout of endogenous redox states at 
different stages during C. elegans lifespan and sorting parameters of 
oxidized and reduced subpopulations. a, Microscopy analysis of the Grx1- 
roGFP2 ratio of individual N2/rls2/Prpl-17::Grx1-roGFP2] worms (symbol) 
cultivated at 15 °C and imaged at the indicated time points. Data points that are 
not significantly different from each other (P> 0.05) are labelled with the same 
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letter. Data are mean +s.e.m;n, number of worms; one-way ANOVA with Tukey 
correction. b, The Grx-roGFP2 ratio (A4os/A4gs) Was calculated using the partial 
profiling feature (pp) configured to analyse extinction and emission data from 
488 nm and 405nmlasers that sequentially excited each worm. c, A population 
of N2jrls2[Prpl-17::Grx1-roGFP2] at the L2 stage, separated based on opacity 
(extinction) and length (time of flight) was gated as R1. 
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Extended Data Fig. 2| Sorting efficiency and lifespan of L2% and L2"4 Pvalues (log-rank test) see Extended Data Table 2. Insets, Grx1-roGFP2 ratio of 
subpopulations. a-d, Microscopy analysis of the Grxl-roGFP2 ratio of individual worms (dots), assessed by fluorescence microscopy after sorting. 
individual worms (dots) previously sorted into L2™, L2™" and L2"4 n, number of worms. For Pvalues (two-sided unpaired t-test) see Extended Data 
subpopulations. n, number of worms; one-way ANOVA with Tukey correction. Table 2. 
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Extended Data Fig. 3| Physiological properties of L2% and L2"‘ sorted 
worms. a, Length measurements of L2™ and L2" worms (symbol) from nose to 
tail tip immediately after sorting. No significant difference; P= 0.4735 
(unpaired two-sided t-test). b, Brood size of L2% and L2"*¢ worms, measured at 
the indicated time points. n, number of worms. No significant difference within 
asingle age; P= 0.6532 (two-way ANOVA). c-f, Basal respiration (c), maximal (d) 
and spare (e) respiratory capacity and basal rates of flux through glycolysis (f) 
of L2™% and L2' worms. n=3 independent sorting experiments. ECAR, 
extracellular acidification rate; OCR, oxygen consumption rate. P= 0.9469 (c), 
P=0.7784 (d), P= 0.7904 (e) and P= 0.7925 (f); two-sided unpaired ¢-test. 

g, Survival of L2™, L2™°" and L2"¢ worms 20 hafter heat shock. n=5 
independent sorting experiments; two-sided unpaired t-test. The connected 
data points represent data from the same sorting experiment. The survival of 
L2™issetto1.Alldataaremean+s.e.m. 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4| Gene expression profiles of L2™ and L2"“. a, Steady- 
state transcript levels of selected oxidative stress-related genes inL2™ and L2'¢ 
worms. n, number of independent sorting experiments; unpaired two-sided 
t-test. Data are mean +s.e.m. b, Volcano plot showing fold changes versus 
Pvalues for the transcriptomes of L2™ and L2" subpopulations. DEGs 

(P< 0.05) are represented by red dots (see Methods for statistical definition 
of DEGs). Data were collected from four independent sorting experiments. 
c, GSEA of the 327 DEGs. Normalized enrichment scores (see Methods for 
calculation) are represented by the bar graph. Terms (for summary, see 
Supplementary Table 1) indicating origin, process or phenotype associated 
with genes known to have arolein the process are shown on the left. Some 
terms (*) have been merged and are represented as a single category bar for 


simplicity (for detailed values, see Supplementary Table 1). d, e, Percentage of 
DEGs identified in L2™ that intersect with H3K4me3 peak signals within their 5’ 
region (500 bp upstream and downstream from the transcription start site). 
The H3K4me3 chromatinimmunoprecipitation (ChIP) datasets were generated 
from L3-staged N2 worms: ChIP chip, GEO entry GSE30789 in dand chromatin 
immunoprecipitation followed by sequencing (ChIP-seq), GEO entry 
GSE28770 ine, indicating that these marks are set during larval development. 
Hypergeometric probability: d, P= 0.064; e, P=2.786 10°. Inf, g, Venn 
diagrams show the overlap among upregulated (f) or downregulated (g) gene 
sets in L2™ and downregulated or upregulated set-9(rw5) and set-26(tm2467) 
gene sets (GEO entry: GSE100623). See Supplementary Table 1 for datasets 
ind-g. 
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Extended Data Fig. 5 | Redox sensitivity of in vivo H3K4m3e3 levels and 

in vitro histone methyltransferase complex activity. a, Global H3K4me3 
levels in the sorted L2™ and L2'*¢ worms. A representative western blot using 
antibodies against H3K4me3 is shown. b,c, Quantification of global H3K27ac (b) 
and H3K27me3 (c) levels by western blot. n=3 independent sorting 
experiments. P= 0.3793 (b) and P=0.0905 (c); unpaired two-sided t-test. Data 
represent mean +s.e.m.d-f, Global H3K4me3 (d), ASH2L (e) and MLL1 (f) levels 
in HeLa cells before and after H,O, treatment, as assessed by western blot. 

g, Time course of the in vitro methyltransferase reaction for core COMPASS 
subunits (SET domain of MLLI-WDR5-ASH2L-RBBPS3S). Reaction rates were 
derived fromthe first 20 min of the linear range. h-j, In vitro histone 
methyltransferase assays of core COMPASS subunits, consisting of purified 
GST-WDRS (WDRS), GST-ASH2L (ASH2L), GST-RBBPS (RBBPS) and either GST- 
MLL1SET domain or untagged MLL1SET domain (h), GST-SET1A SET domain (i) 
or GST-SET1B SET domain (j). Superscript ox indicates that the protein was 
pre-treated with either 1 mM (+) or 2mM (++) H,O, for 30 min before the activity 
assay. DTT was added after the H,O, treatment. n=3 independent experiments; 
one-way ANOVA with Sidak correction. Data are mean+s.e.m.k, The MLL1SET 


domain was treated with either 2mM DTT, 2mMH,0, or 2mM H,0,, followed by 
4mM DTT. Catalase was used to quench the H,O,. The proteins were denatured 
and thiols were modified with NEM before loading onto non-reducing SDS- 
PAGE to prevent non-specific thiol oxidation. The proteins were visualized by 
silver staining. M, marker. 1, MLL1SET domain treated with either 2mM DTT, 
2mMH,O, or2mMH,0,, followed by 4mM DTT. All reduced protein thiols 
were then labelled with the 500-Da thiol-reactive compound AMS, causing a 
500-Da mass decrease per oxidized thiol—detectable on reducing SDS- 

PAGE. M, protein marker. m, Cysteine oxidation state in MLL1SET domain after 
treatment with either 2mM DTT or 2mMH,0, followed by NEM labelling as 
assessed by LC-MS/MS. The peptide containing Cys3967 could not be 
detected. n, Schematic representation of the redox sensitivity of the MLLISET 
domain. For blot and gel source images, see Supplementary Figs. 1and 3. 

o, Sequence alignment of the SET domain. All cysteines in MLL1areshownin 
bold, and the five absolutely conserved cysteines are highlighted in yellow. 
Cysteines shown to be involved in zinc coordination are marked with an 
asterisk. NCBI protein BLAST and Clustal Omega Multiple Sequence 
Alignment, Clustal O (1.2.4) were used. 


Article 


a b c 
1.5 mRNA ASH-2 H3K4me3 
P= 06x10" + Fs 
o control ash-2 a Os & 
2 EV. RNAi S & & 
ale 
To) 
WL 
control ash-2  set-2 
E.v. RNAi RNAi 
d e P=0,0272 
20 P= 0.0388 100 e 
@ control E.V. me N2 
@) ash-2 RNAi . 80 MM set-2 (0k952) 
a 15 @B war-5.1 RNAi . a MB war-5.1 (0k1417) 
D aD 
c Cc 
oO ao 
5 10 5 
Ke] ae] 
i cL 
5 . 
2 i 2 . 
hsf-1 hsp-1 hsp-16.2 hsf-4 hsp-1 Repete2 
ASH2L 3 3 8 ? 
o 
= is = a =e E.V./untr. 
wom i @ control E.V. nN =O E.V./PQ. 
c 2 6 aa oa ©) ash-2 RNAi = 3 ~e ash-2luntr. 
3 1.0 o ! MM sef-2RNAi 3 eL -o- ash-2/PQ 
S co ' @ wor-5.1 RNAi z a t-2/unt 
= xo 4 ! Oo =® sel-c/untr. 
a °8 ' -O set-2/PQ 
2 05 a a ' 8 
Z 0. a 3 
Y ~2 ' ja 
- 2 
1 
0) 0 . 
non target ASH2L 
siRNA siRNA 


Extended Data Fig. 6 | Effects of H3K4me3 downregulation on heat-shock 
response and endogenous redox state. a, ASH-2 and SET-2 transcript levels of 
N2jrls2[Prpl-17::Grx1-roGFP2] worms treated with ash-2 or set-2 RNAi for 2 
generations. n= 6 (ash-2) and n=2 (set-2) independent experiments; 

unpaired two-sided f-test. E.V., empty vector. b, ASH-2 protein levels in 
N2jrls2[Prpl-17::Grx1-roGFP2] worms treated with control RNAi or ash-2RNAi 
for two generations using western blot analysis. c, H3K4me3 levels in 
N2jrls2[Prpl-17::Grx1-roGFP2] worms treated with control RNAi, ash-2 RNAi or 
set-2 RNAi for two generations. d, Transcript levels of selected heat-shock 
genes after heat-shock treatment of N2jrls2/Prpl-17::Grx1-roGFP2] worms 
treated with the indicated RNAi. n=3 independent experiments; one-way 
ANOVA with Bonferroni correction. e, Transcript levels of selected heat-shock 


genes in set-2 or wdr-5.1 mutants before and after heat-shock treatment. n=3 
independent experiments; one-way ANOVA with Bonferroni correction. 

f, ASH2L levels following ASH2L siRNA treatment of HeLa cells.n=2 
independent experiments. g; Grxl-roGFP2 ratios of L2 larval worms treated 
with ash-2 RNAi, set-2 RNAi, wdr-5.1 RNAi or the empty vector were measured 
using the BioSorter. n=4 (ash-2, set-2) and n=3 (wdr-5.1) independent sorting 
experiments; unpaired two-sided t-test. h, Representative survival curves of 
N2jrls2[Prpl-17::Grx1-roGFP2] worms treated with ash-2or set-2 RNAi for two 
generations and treated with 1 mM paraquat (PQ) for 10 hat the L2 larval stage. 
For nnumbers, repetitions and statistics (log-rank), see Extended Data Table 4. 
Dataina, d-g, represent mean +s.e.m. For blot source images, see 
Supplementary Figs. 1, 3. 


Extended Data Table 1| Lifespan assays of L2™, L2™°" and L2'** subpopulations following heat-shock treatment or in the 
continuous presence of paraquat or juglone 


Dead/ P 


Condition Experiment Sk Mean Max _ censored mae ole 70 on Boe value 
worms (BioS.) 
L2™ 19 32.5 24 7.44+0.01(n=238) 

1 Lamean 17 31.5 19/4 0.639 10.5%* Figure 2a 7.37+0.02(n=135) 0.002 
3 E Lares 13 27.3 32/1 0.0088 31.6%* 7.29+0.02(n=132) 0.2x10°° 
2 2 ; Lax 17.5 25 29/13 7.06+40.01(n=698) 

Is Lard 12 21 20/3 0.0077 31.4%* 7.0140.012(n=319) 0.002 
Lax 18 25.9 96/2 7.1540.007(n=893) 

Lared 15 24.2 7/1 0.0153 16.7%" 7.1240.012(n=914) 0.009 
Lax 17 26 13 ; 7.0940.02(n=116) 

1 L2mean 14 26 11 0.4456 17.6%" Figure 2b 6.99+0.033(n=113) 0.006 
= Lared 10 21 20 0.0215 41.2%* 6.6740.045(n=108) —0.2x10°3 
se Lox 13 18.5 38 7.354£0.017(n=158) 

CN 2 
a Lared 11 18.5 40 0.0456 15.4%" 7.14£0.011(n=277) —0.1x10°79 
Lax 13 18.6 47 6.7140.02(n=139) 
. Lared 12 15.5 38 0.0037 16.7% 6.334£0.06(n=100) 0.8x107 
Lax 23 32.4 72 6.5940.012(n=428) 
e = ; Lared 19 32.4 73 0.0033 17.4%" 6.554£0.012(n=513) 0.007 
28 Lax 18 33.3 86 7.4340.01 (n=186) 
: Lared 18 29.5 59 0.0202 11.4% 7.2740.015(n=161) — 0.1x10°8 


Worms were sorted onto NGM plates and exposed to heat-shock treatment or the indicated compounds. P values for Kaplan-Meier survival analysis were calculated on the basis of the log-rank 
(Mantel-Cox) method. Samples were compared with L2™ subpopulations. Maximum lifespan was defined as the lifespan of the last 10% of the population. 

*Percentage change based on mean lifespan. 

tPercentage change based on maximum lifespan. 

+Grx1-roGFP2 ratio of L2™ and L2"? subpopulations, based on BioSorter analysis during sorting. Data are mean + s.e.m. n, number of worms. Two-sided unpaired t-test. 
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Extended Data Table 2 | Lifespan assays of L2™, L2™°*" and L2'*4 subpopulations 


Sorted Mea Satie P % Figure Ratio Pe Ratio P value 
Exp. 7 Max censored value : value . : 
population n : change _ on text BioSorters 7 Microscopel (Micr.) 
worms (survival) (BioS.) 
L2ex 30 35.8 97 6.88+0.02(n=134) 0.49+0.008(n=36) 
1 [amean 26 33.5 74 0.007 13.3%* Fig. 2c 6.6940.02(n=143) 0.1x10% 0.45+40.003(n=49) 0.1x10% 
L2red 25 32.3 58 0.0004 17%* 6.6240.04(n=131) 0.4x10% 0.44+0.003(n=50) 0.1x10°7 
L2 28 35.1 78/2 7.1440.02(n=127) 0.96+0.07(n=7) 
2 L2mean 25 34.3 105/3 0.1018 10.7%* 7.11+40.03(n=133) 0.532 0.69+0.01(n=7) 0.0027 
L2red 23 33.5 63/1 0.0118 17.8%* 6.94+0.07(n=136) 0.0101 0.66+0.01(n=10) 0.0002 
L2°« 21 34 94/2 7.45+0.02(n=139) 0.5140.03(n=14) 
3 
La 20 30 51 0.0088 12% 7.3540.01 (n=716) 0.0003 0.4540.01(n=13) 0.0036 
Lae 31 39.4 162/3 6.6140.02(n=170) 0.640.02(n=48) 
4 
Lares 28.5 38.8 120 0.042 8%* 6.55+0.02(n=161) 0.0456 0.54+0.01(n=44) 0.0037 
Lax 28 36 114/4 7.29+0.01(n=191) 0.48+0.03(n=10) 
5 
L2re 26 35.4 106 0.0362 7T%* 7.1240.02(n=88) 0.3x10° 0.3940.004(n=14) 0.0063 
P Lax 24 34 164/7 6.1940.02(n=275) 1.6840.01(n=9) 
6 
Lares 23 34 155/9 0.0862+ 4%* 6.03+0.02(n=273) 0.5x107 1.65+0.01(n=8) 0.0583 
E.V.* 24 34 46 5,55+0.04(n=114) 
7 Fig. 4d 
EN 21 28.3 63 0.0004 12.5%* 4.49+0.067(n=114) 0.5x10°8 
E.vV.% 24 33 77/3 6.3740.035(n=60) 
8 Fig. 4e 
EV 20 31 80 0.002 17%* 5.85+0.076(n=63) 0.1x107 
E.V.% 20 33 85 6.0840.046(n=133) 
9 
EV 17 30.7 77 0.0264 15%* 5.87+0.052(n=107) 0.0032 
E.V.% 18 28.7 89/1 6.0340.022(n=225) 
10 
E.V.red 18 25.4 85 0.0077 11.5%1 5.10+0.038(n=226) 0.3x10°® 


Worms were sorted onto NGM plates and assessed for lifespan. P values for Kaplan-Meier survival analysis were calculated on the basis of the log-rank (Mantel-Cox) method unless noted 
otherwise. Samples were compared with L2™ subpopulations. Maximum lifespan was defined as the lifespan of the last 10% of the population. E.V., empty vector (control RNAi conditions for 
experiments in Extended Data Table 4). 

*Percentage change based on mean lifespan. 

}Percentage change based on maximum lifespan. 

+P value for survival analysis calculated based on the Gehan-Breslow-Wilcoxon method. 

SGrx1-roGFP2 ratio of L2 and L2'* subpopulations, based on BioSorter analysis during sorting. Data represent mean + s.e.m.n, number of worms. Two-sided unpaired t-test. 

'Grx1-roGFP2 ratio of L2™ and L2"? subpopulations based on microscopy analysis, following sorting. Data are mean + s.e.m. n, number of worms. Two-sided unpaired t-test. 

‘Lifespan assay conducted with experimenter blind to the type of worms assayed. 


Extended Data Table 3 | Lifespan assays of L2™ and L2'** subpopulations following transient exposure to oxidizing or 
reducing conditions 


Dead/ Ratio P 


Experiment Sorted population Mean Max ping Pvalue %change _ Figure on text BioSorterS hess 
L2™* 24 34.4 91 7.0740.018(n=317) 
1mM PQ 26 33.7 85 0.4364 7.7%* Figure 2f 
10 mM NAC 19 32.9 99 0.008 20.8%* 
: Lard 21 31.2 109 7.40.011(n=402) 0.00074 
1 mM PQ 23 31.7 139/1 0.0261 8.7%* Figure 2e 
10 mM NAC 21 29.4 157 0.0546 - 
Lax 16 23.3 86 7.09+0.007(n=800) 
2 L2red 15 19 54 0.0002 18.4%t 7.05+0.009(n=1218) 0.00034 
L2¢+1mMPQ 15.5 25.6 64 0.0002! 9%t 
Lax 23 29.8 89 7 
: 10 mM NAC 21 30.4 50 0.05* 8.7%* 
L2Mixed 15 21.1 66 
0.1 mM PQ 15 20.6 107 0.5137 2.4%t 
4i Figure 2d 
1mMPQ 15 27.8 96 0.0175 24.1%1 
10 mM NAC 14.5 19.5 66 0.3667 7.6%" 
L2Mved 12 16 84 
‘ 0.1 mM PQ 12 20.7 101 0.003 22.7%! 
. 1mMPQ 12 29.7 88 0.0023 46.1% 
10 mM NAC 11 18.7 101 0.1809 14.4%" 


Worms were sorted into liquid medium, exposed to paraquat or NAC for 10 h and then returned to NGM plates for lifespan assessment. P values for Kaplan-Meier survival analysis were 
calculated on the basis of the log-rank (Mantel-Cox) method unless noted otherwise. Samples were compared with L2™ subpopulations or L2™°4 populations. Maximum lifespan was 
defined as the lifespan of the last 10% of the population. 

*Percentage change based on mean lifespan. 

tPercentage change based on maximum lifespan. 

+P value for survival analysis calculated based on the Gehan-Breslow-Wilcoxon method. 

SGrx1-roGFP2 ratio of L2™ and L2'* subpopulations at the L2 stage, based on BioSorter analysis during sorting. Data are mean + s.e.m. n, number of worms. Two-sided unpaired t-test. 
Sample compared with the reduced subpopulation. 

‘Lifespan assays performed in the lifespan machine (see Methods). 
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Extended Data Table 4 | Lifespan assays upon H3K4me3-targeting RNAi treatment 


Dead/ P 


: Figure Ratio 
Exp. Sorted population Mean Max _ censored Pvalue % change ‘ t value 
worme on text BioSorter' (BioS.) 
ash-2 RNAi ™ 24 33.3 40/2 5.00+40.043(n=95) 
& : 
1 Figure 4d 
ash-2 RNAi 4 25 34 45/4 0.1373 4.2%* 4.66+0.046(n=85) 0.1x10°% 
set-2 RNAi ™ 22 32 57/1 6.1740.047(n=52) 
& ; 
2 Figure 4e 
set-2 RNAi "4 22 30 72/1 0.7739 . 5.64+0.03(n=224) 0.2x10°"4 
ash-2 RNAi ™ 30 39.1 70 6.12+0.03(n=186) 
; ash-2 RNAi "4 30 37.1 79 0.5721 - 5.8+0.039(n=216) 0.9x10°° 
3 
set-2 RNAi ™ 20 34 76 6.13+0.019(n=259) 
set-2 RNAi @4 20 33 55 0.7339 - 5.6740.026(n=295) —0.1x10°8 
ash-2 RNAi ™ 25 30.2 87 6.1740.021(n=223) 
. ash-2 RNAi "4 25 30.4 79 0.9133 - 5.56+0.046(n=223)  0.5x10°6 
4 
set-2 RNAi * 18 25.7 76 6.19+0.025(n=214) 
set-2 RNAi @4 18 27 85 0.8589 4.8%t 5.29+0.041(n=217) — 0.1x10°? 
control E.V. untr. 21 30 78 
control E.V. PQ 21 31.7 64 0.5073 - 
ash-2 RNAi untr. 23 30.4 70 
5 Ext. Data Fig. 6h 
ash-2 RNAi PQ 19 27.6 52 0.0017 17.4%* 
set-2 RNAi untr. 21 31.3 59 
set-2 RNAi PQ 21 32.1 67 0.4515 - 
control E.V. untr. 20 29.5 58 
control E.V. PQ 18 32.3 43 0.7952 10%* 
ash-2 RNAi untr. 23 31.4 46 
6 
ash-2 RNAi PQ 17 32.3 30 0.3873 26%* 
set-2 RNAi untr. 20 28.8 47 
set-2 RNAi PQ 23 28.7 39 0.3225 13%* 


Worms were treated with the indicated RNAi and sorted into L2™ and L2'*“ at the F, generation (experiments 1-4). 

®The corresponding empty-vector controls for experiments 1-4 are shown in Extended Data Table 1 experiments 7-10. Worms were treated with 1 mM paraquat for 10 h at the L2 stage following 
RNAi treatment with ash-2 or set-2 (experiments 5 and 6). P values for Kaplan-Meier survival analysis were calculated on the basis of the log-rank (Mantel-Cox) method. Samples were compared 
with the respective L2™ or untreated control. Maximum lifespan was defined as the lifespan of the last 10% of the population. 

*Percentage change based on mean lifespan. 

tPercentage change based on maximum lifespan. 

+Grx1-roGFP2 ratio of L2” and L2'*4 subpopulations, based on BioSorter analysis during sorting. Data are mean + s.e.m. n, number of worms. Two-sided unpaired t-test. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 
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n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


O For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection BioRad - Image Lab (v5.2.1) was used for collecting gel and blot images 
Union Biometrica - FlowPilot v. 1.5.9.4 was used to collect and analyze Biosorter data 
No software or computer code was used to generate data 


Data analysis Prism - GraphPad Software Inc. (v7.0c) 
Fiji - ImageJ (v1.52a) bundled with Java 1.8.0_112 
Microsoft Office 2010 Excel 
FastQC (version v0.11.3) 
TopHat (version 2.0.13) 
Bowtie2 (version 2.2.1.). 
Cufflinks/CuffDiff (version 2.1.1) 
CummeRbund R 
Bioconductor Package GSA 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


RNA-sequencing data described in this study have been uploaded to the Gene Expression Omnibus (GEO) database with accession number GSE138502 and will be 


made public upon publication. 
Source data files as well as uncropped gel and blot images are provided with the paper. Supplementary Table 1 contains RNAseq data and analysis. 
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xX] Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 
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Sample size No sample-size calculations were performed. Sample size was determined to be 
adequate based on the magnitude and consistency of measurable differences 
between groups in preliminary experiments. 
RNA sequencing was performed in 4 biological replicates. The very high correlation between these replicates suggests that this is sufficient. 
In many experiments numerous worms were available and large sample sizes were used. 


Data exclusions _ In lifespan assays, we excluded plates based on the following pre-established criteria: 1) growth of bacteria that were not originally seeded on 
the plate 2) fungal growth 3) plate desiccation. 


Replication All findings reported were reliably reproduced in the lab on independent occasions : 1) different facilities 2) different C. elegans stocks 3) un 


Randomization Samples were allocated into groups (oxidized vs reduced) based on Grx1-roGFP2 readouts using a large - particle flow cytometer i.e. BioSorter 


Blinding Blinding was performed on lifespan assays 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used anti-H3 (Abcam, # ab1791; Lot #GR325252171-1; 1:2,000 working dilution) 
anti-H3K4me3 (Abcam, # ab8580; Lot # GR3275503-1; 1:1,000 working dilution) 
anti-H3K27ac (Abcam, # ab4729; Lot # GR3231988-1; 1:1,000 working dilution) 
anti-H3K27me3 (Millipore, # 07-449; Lot # 2919706; 1:1,000 working dilution) 
anti-ASH-2 (Abmart, # X3-G5EFZ3; Lot 31284-1; 1:1,000 working dilution) 
anti-B-tubulin (Santa Cruz, # sc-5274); Lot # B2218; 1:2,000-2,500 working dilution 
anti-ASH2L (Bethy! laboratories, # A300-489A; Lot # 2; 1:1,000 working dilution) 
anti-MLL1 (Bethy! laboratories, # A300-374A; Lot #5; 1:500 working dilution) 


Validation All the antibodies used in this work are commercially available and have been published/cited. 
https://www.citeab.com/antibodies/763778-ab1791-anti-histone-h3-antibody-nuclear-loading-con 
https://www.citeab.com/antibodies/763751-ab8580-anti-histone-h3-tri-methyl-k4-antibody-chi?des=E833EC8BFEO454AA 
https://www.citeab.com/antibodies/778149-ab4729-anti-histone-h3-acetyl-k27-antibody-chip-g?des=EE13B14650FDA8F3 
https://www.citeab.com/antibodies/221356-07-449-anti-trimethy|-histone-h3-lys27-antibody?des=348CD7BFAOCC732E 
https://www.citeab.com/antibodies/835112-sc-5274-tubulin-antibody-d-10?des=517C27105F70CC7F 
https://www.citeab.com/antibodies/655121-a300-489a-ash2-antibody?des=32EFD81E08FF28C8 
https://www.citeab.com/antibodies/654913-a300-374a-mll1-antibody?des=620F371A0BC1D965 


The ASH-2 C. elegans antibody from Abmart (X3 -G5EFZ3) was tested against C. elegans treated with ash-2 RNAi and with 
purified recombinant C. elegans ASH-2. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) HeLa EM-II cell line was a gift from JK Nandakumar (University of Michigan) 
Authentication STR 
Mycoplasma contamination HeLa EM-II cell line has been tested in the lab for mycoplasma contamination 


Commonly misidentified lines No commonly misidentified lines were used. 
(See ICLAC register) 
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Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals All C. elegans strains used in this study are detailed in Methods 
Wild animals The study did not involve wild animals 

Field-collected samples The study did not involve field-collected samples 

Ethics oversight 0 ethical approval or guidance is required. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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In the interphase of the cell cycle, chromatin is arranged ina hierarchical structure 
within the nucleus’, which has an important role in regulating gene expression® °. 
However, the dynamics of 3D chromatin structure during human embryogenesis 
remains unknown. Here we report that, unlike mouse sperm, human sperm cells do 


not express the chromatin regulator CTCF and their chromatin does not contain 
topologically associating domains (TADs). Following human fertilization, TAD 
structure is gradually established during embryonic development. In addition, A/B 
compartmentalization is lost in human embryos at the 2-cell stage and is re- 
established during embryogenesis. Notably, blocking zygotic genome activation 
(ZGA) can inhibit TAD establishment in human embryos but not in mouse or 
Drosophila. Of note, CTCF is expressed at very low levels before ZGA, and is then 
highly expressed at the ZGA stage when TADs are observed. TAD organization is 
significantly reduced in CTCF knockdown embryos, suggesting that TAD 
establishment during ZGA inhuman embryos requires CTCF expression. Our results 
indicate that CTCF has a key role in the establishment of 3D chromatin structure 
during human embryogenesis. 


During mouse embryogenesis, TADs are disorganized in 2-cell embryos 
and become increasingly established during development”*. To study 
the process of 3D chromatin structure reprogramming in human 
embryos, we examined human sperm, 2-cell embryos, 8-cell embryos, 
morula, blastocysts and six-week-old embryos. We optimized a previ- 
ous low-input Hi-C (genome-wide chromosome conformation capture) 
method? by using as few as 50-100 cells to construct Hi-C libraries in 
human early embryos (Methods). Our newultra-low-input Hi-C method 
can detect very similar chromatin structures ina single mouse blastocyst 
to those obtained from pooled mouse blastocysts using the previous 
method? (Extended Data Fig. 1a—c). At least two biological replicates 
were sequenced for each collected developmental stage, (Supplemen- 
tary Table 1) and showed high reproducibility (Extended Data Fig. 1d-f). 


TAD structures in human early embryos 


Next, we examined chromatin interactions in human sperm and 
embryos. We did not detect the characteristic ‘triangle’ interactions of 
TAD structures in human 2-cell embryos. There was a low level of these 
interactions in 8-cell embryos, and the level increased during human 
embryonic development (Fig. 1a). To rule out the possible influence 
of read depth on the analysis, we picked the same number of reads at 
random when plotting interaction heat maps for each stage, showing 
consistent results (Extended Data Fig. 2a). 


We further investigated TAD reprogramming during human 
embryogenesis. We used a TAD separation-score method (Methods) 
to call TAD domains and TAD boundaries (Supplementary Table 2). 
The majority of TAD domains can be detected even with very low 
read depth (Extended Data Fig. 2b, c). We also calculated TAD signal 
and directional index’. Our data show that TAD signal variance and 
directional index are lowest at the 2-cell stage and increase gradually 
through development (Fig. 1b, Extended Data Fig. 2d, e). To exclude 
possible experimental bias in TAD analysis of human embryos, we used 
mouse morula embryos as spiked-in controls, which were mixed in 
with human 2-cell, 8-celland morula embryos. We then constructed 
Hi-C libraries for these mixed samples (Methods). In parallel, we also 
generated Hi-C libraries for mouse morula embryos that were not 
previously mixed with human samples. These results show that TAD 
structures from the mixed samples remain obscure in human 2-cell 
embryos and become clearer in 8-cell embryos and morulaembryos 
(Extended Data Fig. 2f, g), whereas all the spiked-in mouse morula 
embryos show clear TAD structures (Extended Data Fig. 2h). TAD 
signal analysis from the mixed samples supports our finding that 
TAD structures become established during human embryogenesis 
(Extended Data Fig. 2i). 

In sum, these data show that TAD structures are largely absent in 
human 2-cell embryos, are weakly present in 8-cell embryos, and 
become increasingly evident during embryonic development. 


'CAS Key Laboratory of Genome Sciences and Information, Collaborative Innovation Center of Genetics and Development, Beijing Institute of Genomics, Chinese Academy of Sciences, 
Beijing, China. ?Center for Reproductive Medicine, Shandong University, The Key laboratory of Reproductive Endocrinology, Ministry of Education, Shandong University, Jinan, China. 
University of Chinese Academy of Sciences, Beijing, China. “CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China. °Center for 
Reproductive Medicine, Ren Ji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China. “Shanghai Key Laboratory for Assisted Reproduction and Reproductive Genetics, 
Shanghai, China. ’These authors contributed equally: Xuepeng Chen, Yuwen Ke, Keliang Wu, Han Zhao. *e-mail: liuj@big.ac.cn; chenzijiang@hotmail.com 
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Fig. 1| Three-dimensional chromatin structures of human sperm and 
embryos. a, Snapshot of the interaction heat map for high-order chromatin 
structures in human sperm, humanembryos and H1 human ES cells (hESCs)" at 
40-kb resolution (pooled biological replicates; n = 2-3). b, Relative variance of 
TAD signal in human embryos calculated with equal numbers of reads 
(generated from 2-3 biological replicates for each stage). Each dot represents 
one chromosome. P values are also shown; two-sided Wilcoxon rank-sum test. 


The dynamics of A/B compartments 


A/B compartmentalization is an important hierarchical structural fea- 
ture of chromatin organization”. We found that the chromosomes 
of human 2-cell embryos lost A/B compartmentalization, but usually 
partitioned into a few large interaction domains (Fig. 1c, Extended Data 
Fig. 3a, b), in contrast to those of mouse zygotes and early embryos 
(Extended Data Fig. 3c). This pattern is similar to the mouse inactive X 
chromosome”. A/B compartmentalization is very weak at the human 
ZGA stage, and becomes more evident during development (Fig. Ic, 
Extended Data Fig. 3a). By contrast, there is clear A/B compartmentali- 
zation at the mouse ZGA stage (Extended Data Fig. 3c—e). The dynamic 
process of A/B compartmentalization in human embryos is further 
supported by compartment strength (Fig. 1d). Additionally, principal 
components (PC) analysis of the morula stage onwards showed bimodal 
distribution patterns of PC1 values, which are not observed at the 8-cell 
stage—suggesting poor compartment segregation at the 8-cell stage 
(Extended Data Fig. 3f). Together, these results suggest that A/B com- 
partmentalization of chromosomes is not present at the 2-cell stage, is 
very weak at the 8-cell stage, emerges clearly at the morula stage and 
becomes more evident at the blastocyst stage. 

Next, we investigated A/B compartment switches during human 
embryogenesis. We found that 13% of genomic regions show A/B com- 
partment switches from the morula stage to the 6-week stage and 77% 
of them change their compartment statuses only once (Extended Data 
Fig. 4a). Then, we examined the potential effect of A/B compartment 
switches on embryonic development. Gene ontology analysis shows 
that genes that undergo A/B compartment switches are significantly 
enriched in developmental processes (Extended Data Fig. 4b, Sup- 
plementary Table 3). For example, genes undergoing a B-to-A switch 
between blastocyst and 6-week stage are enriched in telencephalon 
development and organ morphogenesis (Extended Data Fig. 4b). VEFM 
and NEFL, which function in maintaining the neuronal calibre, switch 
their B compartment status to A compartment and become highly 
expressed (Extended Data Fig. 4c). 


Data are meant+s.e.m.c, PCl value tracks and Pearson correlation heat maps 
for chromosome 5 at 500-kb resolution in human embryos with equal 
numbers of reads (generated from 2-3 biological replicates for each stage). 
d, Compartment strengthin human embryos with equal numbers of reads. 
Data are mean+s.d., obtained by bootstrapping (n=100). Pvalues were 
calculated by one-sided f-test. 


Intensive genome-wide epigenetic reprogramming occurs during 
human embryonic development” ©. We analysed how A/B compart- 
ments and other epigenetic marks are orchestrated in human embryos. 
Consistent with the previous report in mouse embryos’, unmethyl- 
ated CpGs in human embryos are enriched in A compartments; the 
level of CoG demethylation in A compartments is significantly higher 
than that inB compartments (Extended Data Fig. 4d, e). Furthermore, 
our data show that DNA demethylation levels are higher in regions 
that maintain their A compartment status from sperm to blastocyst 
(Extended Data Fig. 4f). This suggests that A compartments provide 
amore-open local nuclear environment that may enable easier access 
for DNA-binding proteins. 


Human sperm do not have typical TADs 


Previous studies have shown that TAD structures are present in mature 
mouse sperm®”*. Surprisingly, we did not observe typical triangle TAD 
structures in human sperm (Fig. 1a). For example, there was no TAD 
boundary at the HOXA cluster region in human sperm, but it was pre- 
sent inhuman blastocysts (Fig. 2a) and mouse sperm (Extended Data 
Fig. 5a). To validate this observation, we plotted TAD signal variance at 
different read depths inhuman sperm and blastocysts. Unlike in mouse 
sperm’ and human blastocysts, the yintercept of the human sperm line 
is close to O (Fig. 2b), suggesting an absence of TADsin human sperm. 
We further compared the density of interaction insert size between 
human sperm and mouse sperm. Our data show that human sperm 
presents one main peak around 4 Mb (middle-range), whereas mouse 
sperm has a shoulder peak around 933 kb and a long-range main peak 
at around 41 Mb (Extended Data Fig. 5b). Additionally, there was also 
a difference between human sperm and mouse sperm in the contact 
probability decay curve (Extended Data Fig. 5c). Together, these results 
suggest that human sperm do not contain TADs. 

To exclude potential experimental bias, we mixed mouse sperm with 
human sperm and constructed a Hi-C library for the sperm mixture 
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Fig. 2| Human sperm does not have typical TAD structures. a, Interaction 
heat maps around the HOXA cluster inhuman sperm and blastocysts at 10-kb 
resolution (pooled biological replicates; n=3).b, Linear regression curves for 
TAD signal variance versus read depth (1/no. of reads) inhuman sperm and 
blastocysts. The linear extrapolation of regression lines for human blastocysts 


(Methods). In parallel, we mixed human HeLa cells and mouse HT22 
cells. Consistent with our previous result, we observed TADs from 
mouse sperm—but not from human sperm—in the sperm mixture 
(Extended Data Fig. 5d, e). By contrast, TAD structures were observed in 
both human HeLa cells and mouse HT22 cells (Extended Data Fig. 5d, e). 
These data corroborate the absence of TADs from human sperm. 

We also inspected A/B compartmentalization in human sperm and 
found that chromatin from human sperm, both in isolation and when 
mixed with mouse sperm, shows clear A/B compartmentalization 
(Extended Data Fig. 5f). 

The CTCF-cohesin complex has important roles in high-order 
chromatin structures*”. We investigated levels of CTCF and cohesin 
protein in human and mouse sperm. RAD21, a subunit of cohesin 
complex, is present in both human and mouse sperm (Extended Data 
Fig. 5g); however, CTCF was not detected in human sperm and was 
very weakly expressed in cell lines in which CTCF was depleted using 
short interfering RNA (siRNA) (Methods), but was detected in mouse 
sperm, human and control mouse cell lines (Fig. 2c, Extended Data 
Fig. 5h). Because depletion of CTCF can lead to the disruption of TAD 
structures’®, the lack of CTCF may underlie the loss of TAD structures 
in human sperm. 


Establishment of TAD boundaries 


TAD structure is obscure in human 2-cell embryos. However, some 
regions in 2-cell embryos show evidence of insulator binding, which 
can separate upstream and downstream interactions (Extended Data 
Fig. 6a). At the later embryonic stages, most of these regions become 
TAD boundaries (Fig. 3a); these regions can therefore be regarded as 
immature TAD boundaries. In this study, we defined both immature 
TAD boundaries and mature TAD boundaries as insulated boundaries. 
We then analysed the dynamics of insulated boundaries during human 
embryonic development. Our data show that 635, 905, 317 and 306 insu- 
lated boundaries developed primarily at the 2-cell, 8-cell, morula and 
blastocyst stages, respectively (Fig. 3a, Supplementary Table 4). We also 
found immature TAD boundaries in mouse 2-cell embryos, and identi- 
fied the stage-gained insulated boundaries in mouse (Extended Data 
Fig. 6b, Supplementary Table 5). We compared the insulated boundaries 
at the 2-cell stage with the total boundaries at the blastocyst stage in 
human samples, showing that 2-cell boundaries overlapped with 30% 
of blastocyst boundaries (Fig. 3b). Our data also show that the ZGA 
stage contained 67% of boundaries present in the blastocyst, which is 
similar to the proportion in mouse models (Fig. 3b). Furthermore, we 
found significant overlap when we compared human boundaries at 
the ZGA stage with those from mouse (Fig. 3c). For example, we found 
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insulated boundaries at the ZGA stage in both human and mouse around 
the TTC1 and CCNGI genes (Extended Data Fig. 6c, d). 

Next, we sought to identify the genomic regions that exhibit a prefer- 
ence for first forming insulated boundaries at an early stage. The distance 
of genomic regions that first gained boundaries at the 2-cell stage to 
housekeeping genes was smaller than that of boundaries regions that 
gained boundaries ata later stage (Fig. 3d). A similar result was observed in 
mouse embryos (Extended Data Fig. 7a). These data indicate that insulated 
boundaries gained at the earlier stage tend to locate around housekeep- 
ing genes in both human and mouse. We also found that the expression 
levels of housekeeping genes near the boundaries tended to be higher 
than those of other housekeeping genes (Extended Data Fig. 7b). 

Repeat elements are reported to associate with TAD boundaries in 
cell lines’. We thus analysed the enrichment of repeat elements around 
stage-specifically gained boundaries in embryos. Our data illustrates 
that Alu repeats, but not LINE or MIR repeats (Fig. 3e, Extended Data 
Fig. 7c—e) repeats, are enriched around the insulated boundaries at the 
earlier stage in human. Similar results are observed in mouse embryos 
(Extended Data Fig. 7f-h). For example, insulated boundaries gained at 
the 2-cell stage around the human or mouse RABSA gene are established 
at an Alu-dense region (Extended Data Fig. 7i). Moreover, our datashow 
that AluS elements are highly enriched around insulated boundaries 
gained at the 2-cell stage (Fig. 3e). Moreover, AluS repeats around insu- 
lated boundaries gained at the 2-cell stage were highly expressed at the 
cleavage stage compared with other stages (Extended Data Fig. 7j, k). 
Collectively, these results show a preference for insulated boundaries 
to locate around Alu-dense regions in human embryos. 


TAD establishment depends on ZGA 


Previous reports showthat TAD establishment is independent of ZGA 
in mouse and Drosophila embryos’*”. We investigated whether these 
characteristics are conserved in human. We treated human zygotes 
with a-amanitin to inhibit ZGA (Extended Data Fig. 8a, b) and collected 
embryosat the 8-cell stage (Methods, Supplementary Table 6). Surpris- 
ingly, TAD structure was obscure in a-amanitin-treated 8-cell embryos 
(Fig. 4a, Extended Data Fig. 8c, d). Relative variance of the TAD signal 
in a-amanitin-treated embryos was also significantly lower than that 
in untreated 8-cell embryos (Fig. 4b). Therefore, TAD establishment 
inhuman embryos requires ZGA. 


CTCF regulates the chromatin landscape 


Next, we aimed to identify the protein involved in TAD establishment 
during ZGA. The cohesin complex and CTCF have important roles 
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Fig. 3 | Establishment of insulated boundaries during human 
embryogenesis. a, Heat map for insulation score, centred at insulated 
boundaries gained at specific stages (+1 Mb range). n, number of stage-specific 
gained insulated boundaries; b, insulated boundary centre. b, The cumulative 
percentage of insulated boundary number in each stage relative to the 
blastocyst boundary number. c, Venn diagram showing overlap between 
human ZGA boundaries and mouse ZGA boundaries (y’ test). d, Cumulative 


in higher-order chromosome structures*”. Thus, we investigated 
differences in expression of these proteins” ”. Subunits of cohesin 
complex, such as RAD21, are already highly expressed before ZGA in 
human embryos (Extended Data Fig. 9a). By contrast, CTCF expres- 
sion is very limited before the ZGA stage, and is sharply increased 
at the 8-cell stage when TAD structure is first observed in human 
embryos (Fig. 4c, Extended Data Fig. 9b). In the a-amanitin-treated 
8-cell embryos, CTCF expression is inhibited (Extended Data Fig. 9c). 
Consistently, immunostaining images show that CTCF protein is 
barely observed in the 2-cell nucleus (Fig. 4d). CTCF is clearly present 
in untreated 8-cell nucleus, but it is absent in the a-amanitin-treated 


distribution function plot for the distance of stage-specific gained boundaries 
to the closest housekeeping genes in human embryos (pooled data from 2-3 
biological replicates). The dash line shows a distance of 200 kb. P=3.24 x 10° 
(2-cell versus 8-cell stage); P= 9.95 x 10°§ (2-cell stage versus morula); 

P=3.83x 10 (2-cell stage versus blastocyst); two-sided Kolmogorov-Smirnov 
test. e, Enrichment of AluS elements at insulated boundaries gained at specific 
stages inhuman embryos. 


8-cell nucleus (Fig. 4d). These results suggest that CTCF expression 
requires human ZGA. 

Next, we investigated the requirement of TAD establishment in 
human embryos for CTCF. We inhibited CTCF expression by inject- 
ing CTCF siRNA (siCTCF) into human zygotes, and collected embryos 
at the morula stage (Fig. 4d). Notably, triangle TAD structures were 
scarcely observed inthe siC7CF morula (Fig. 4e, Extended Data Fig. 9d). 
Relative TAD signal variance supports inhibition of TAD establishment 
in the siC7CF morula (Extended Data Fig. 9e). Consistently, most of 
the TAD boundaries vanish in the control morula and become weaker 
in the siC7CF morula (Extended Data Fig. 9f). Thus, our data suggest 
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Fig. 4| CTCF regulates the establishment of chromatin landscape inhuman 
embryos. a, Interaction heat maps in human 8-cell and a-amanitin-treated 
8-cellembryos. b, Box plot for relative variance of TAD signal in human 2-cell 
(n=3), 8-cell (n=3) and a-amanitin-treated 8-cell embryos (n=3). Boxes show 
25th, 50th and 75th percentiles and whiskers show 1.5x the interquartile range. 
***A djusted P< 0.001 for all pairwise comparisons between 8-cell and 
a-amanitin-treated 8-cell (two-sided Wilcoxon rank-sum test with Benjamini- 


Hochberg multiple testing correction). c, Dynamics of CTCF expression during 
human embryonic development (expression data from ref. ”*; 3-20 cells for 
each stage). Dataare mean +s.e.m. RPKM, reads per kilobase of transcript per 
million mapped reads. d, Immunofluorescence of CTCF inhuman embryos 
(n=2-3). Scale bar, 40 pm. e, Track snapshot for TAD structures in untreated 
control morulaand siC7CF morula, overlaid with gene expression. 
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that CTCF expression during ZGA is required for TAD establishment 
inhumanembryos. 

Frequently interacting region (FIRE) is a recently defined hierarchy of 
chromatin high-order structure”. Weidentified FIREsat40-kbresolution 
in human embryos. Human embryo FIREs are enriched in A compart- 
ments (Extended Data Fig. 10a—c). Our data show that both assay for 
transposase-accessible chromatin (ATAC) peaks™ and DNase | hyper- 
sensitive sites (DHSs)* are enriched in FIREs (Extended Data Fig. 10d, e). 
To check whether CTCF knockdown has an effect on FIRE scores in 
human embryos, we analysed the FIRE score inthe siC7CF morula and 
found that FIRE scores were significantly decreased (Extended Data 
Fig. 10f), consistent with previous reports in cell lines”. 

Additionally, we analysed the gene expression profile of the siCTCF 
morula and identified 565 downregulated genes inthe siCTCF morula 
(Extended Data Fig. 10g, h). Our data show that highly downregulated 
genes in the siCTCF morula usually have CTCF-binding peaks around 
transcription start sites (TSSs) in human embyonic stem (ES) cells 
(Extended Data Fig. 10i). 

Finally, we wanted to know whether CTCF expression is the sole deter- 
minant of TAD establishment. To address this question, we recovered 
CTCF expression by injecting CTCFmRNA into a-amanitin-treated human 
zygotes and collected the treated embryos at the 8-cell stage (Extended 
Data Fig. 11a, Methods). We did not observe clear triangle TAD patternsin 
8-cell embryos that had been treated with a-amanitin and CTCF mRNA, 
incontrast tothe patterns observed in untreated human 8-cell embryos 
and spiked-in mouse morula embryos (Extended Data Fig. 11b-e). Further 
insulation boundary analysis and TAD strength analysis” validated that 
TAD was not established in embryos treated with a-amanitin and CTCF 
mRNA (Extended Data Fig. 11f, g). In summary, our data suggest that 
CTCF expression is required, but is not the only factor needed, for the 
establishment of chromatin structure during human ZGA. 

Although both human and mouse embryos show genome-wide repro- 
gramming of high-order chromatin structure, there are substantial dif- 
ferences in chromatin structure between human and mouse embryos. 
Our data provide a valuable resource and mechanistic insights into the 
establishment of chromatin structure during mammalian embryonic 
development. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Human subjects 

The regulatory framework about the use of human gametes and 
embryos for this research was based on the policies of the Human 
Biomedical Research Ethics Guidelines (set by National Health Com- 
mission of the People’s Republic of China on 1 December 2016),the 2016 
Guidelines for Stem Cell Research and Clinical Translation (issued by 
the International Society for Stem Cell Research, ISSCR) and the Human 
Embryonic Stem Cell Research Ethics Guidelines (set by China National 
Center for Biotechnology Development on 24 December 2003). These 
policies and guidelines permit human gametes, and/or human embryos 
created or genetic manipulated in vitro no more than 14 days, can be 
used specifically for scientific researches. All the experiments in this 
study are in compliance with these relevant ethical regulations. 

The aim and protocols of this study has been reviewed and approved 
by the Institutional Review Board of Reproductive Medicine, Shandong 
University and Beijing Institute of Genomics. The protocols include 
the injection of siRNA into human embryos. Oocytes donated from 
patients taking in-vitro fertilization treatments, were fertilized using 
donated sperm by intracytoplasmic sperm injection (ICSI). Written 
informed consent was obtained from all oocyte and sperm donors. 
The donor women are 25-38 years old with tubal-factor infertility and 
their partners have healthy semen. The embryos with high quality were 
randomly assigned to experimental groups without examination about 
the gender of embryos owing to the limited cell number. All embryos 
are collected under standard clinical protocols. The number of embryos 
used at each developmental stage used for Hi-C in this study is shown 
in Supplementary Table 1. All control and siRNA-injected embryos were 
cultured no more than 7 days and only used for molecular research 
analyses. No statistical methods were used to predetermine sample size. 


Human embryos collection 

We collected human 2-cell, 8-cell, morulaand morphological AA grade 
blastocysts at appropriate time after in vitro fertilization (IVF). The 
six-week embryos (6-week) were obtained from IVF couples by trans- 
vaginal multifetal pregnancy reduction surgery at the six-week stage 
of pregnancy. 

For 2-cell embryos, they were from immature metaphase I (MI) 
oocytes that were not used clinically. MI oocytes matured in vitro 
and were fertilized by donated sperm. These fertilized embryos were 
then cultured in G1.5 medium (Vitrolife) ina humidified atmosphere at 
37 °C with 6% CO, inair. The 2-cell embryos were collected and vitrified 
around 27 hafter routine fertilization. The embryo vitrification was car- 
ried out as previously described”. In brief, the embryos were incubated 
in vitrification solution 1 consisting of 8% ethylene glycol and 8% dime- 
thyl sulfoxide in Cryobase (10 mM HEPES-buffered medium containing 
20 mg mI human serumalbumin and 0.01 mg mI" gentamicin) at room 
temperature for 11 min. After initial shrinkage, embryos with original 
volume were transferred into vitrification solution 2 (16% ethylene 
glycol, 16% dimethyl sulfoxide and 0.68 M trehalose in Cryobase) for 
1-1.5 min. Embryos were finally transferred onto Cryotop strip in an 
extremely small volume of solution (<0.1 ml) and plunged into liquid 
nitrogen. After addition of the protective cover, the Cryotop was trans- 
ferred into liquid nitrogen for storage. 

The embryos of 8-cell, morula and blastocyst stages were from 
donated frozen embryos with signed informed consent. These embryos 
were vitrified at 3 day, 4 day and S day after routine in vitro fertiliza- 
tion, respectively. The vitrified embryos on Cryotop strip were thawed 
rapidly by taking them from the liquid nitrogen after removal of the 
protective cover, then immersed in 2.5 ml of 37 °C warming solution1 


(1M trehalose in Cryobase) for 1 min ona heated stage. Embryos were 
then transferred to 0.5 ml of warming solution 2 (0.5 M trehalose in 
Cryobase) for 3 min, and then placed into 0.5 ml Cryobase for 5 min 
followed by fresh 0.5 ml Cryobase for 1 min. Embryos were finally trans- 
ferred to G1.5 or G2 medium (Vitrolife) to evaluate embryo quality. 

The thawed embryos with high quality were picked randomly for 
experimental groups. The zona pellucida was removed by mechanical 
dissection with a glass needle. Embryos were washed several times 
by gentle pipetting with a narrow-bore glass pipette to remove the 
attached cumulus or polar bodies. After final washing with 0.1% BSA/ 
PBS three times, the embryos were transferred into microcentrifuge 
tubes (Sorenson BioScience, 39640T). Experiments were performed 
on at least two biological replicates for each embryonic stage. 


Ultra-low-input Hi-C 

The Hi-C library generation with ultra-low number of cells was opti- 
mized according to previous protocols®”°. Samples were fixed with 20 pl 
of freshly made 1% formaldehyde solution and incubated at room tem- 
perature for 10 min. To quench the reaction, 2.5 M glycine solution 
was added to a final concentration of 0.2 M. Samples were incubated 
at room temperature for 5 min and then were lysed directly in 100 ul 
of ice-cold Hi-C lysis buffer (10 mM Tris-HCI pH 8.0, 10 mM NaCl, 0.2% 
Igepal CA630) with a protease inhibitor cocktail onice for 15 min. Sam- 
ples were then centrifuged at 3,000g for 5 min and the supernatants 
carefully discarded. Pelleted nuclei were washed once with 100 pl of 1x 
NEBuffer 2; discard the supernatant and remain 4.5 pl, and add 0.5 pl of 
5% sodium dodecyl sulfate (SDS). Tubes were gently tapped to mix the 
pellet and were incubated at 62 °C for 5 min. After incubating, 14.5 pl 
of water and 2.5 pl of 10% Triton X-100 were added to quench the SDS. 
Tubes were gently tapped to mix well, avoiding excessive foaming and 
then incubated at 37 °C for 15 min. Two and a half microlitres of 10x 
NEBuffer 2 and 10 U of Mbol restriction enzyme (NEB, RO147) were 
added and chromatin was digested at 37 °C for 5h. Samples were incu- 
bated at 62 °C for 20 min to inactivate Mbol and then cooled to room 
temperature. To fill in the restriction fragment overhangs and mark 
the DNA ends with biotin, 5 pl of fill-in master mix (3.75 pl of 0.4 mM 
biotin-14-dATP, 0.45 pl of 1O mM dCTP/dGTP/dTTP mix, 0.8 plof5 U pl 
DNA polymerase l, large) was added. Samples were mixed by pipetting 
and incubated at 37 °C for 30 min. Ligation master mix (66.3 pl of water, 
12 pl of 10x NEB T4 DNA ligase buffer, 10 pl of 10% Triton X-100, 1.2 pl 
of 10 mg mI bovine serum albumin, 1 pl of 400 U/pl T4 DNA ligase) 
was added and samples were incubated at 16 °C for 18 h. Nuclei were 
pelleted by centrifugation for 5 min at 3,000g and were washed with 
100 pl of 10 mM Tris buffer, pH 8.0. Pellets were then resuspended 
in 50 pl of 10 mM Tris buffer with 2 pl of 20 mg mI" proteinase K and 
incubated at 65 °C for 18 h. Proteinase K was inactivated by incubation 
at 75 °C for 30 min. To fragment the DNA ligation products, 50 ng of 
A DNA was added, and the DNA samples were digested with 1 U Alul at 
37 °C for 4 hand inactivated at 65 °C for 30 min. Then the fragmented 
DNA was treated with the End Repair/dA-Tailing Module (NEB, E7442L) 
and Ligation Module (NEB, E7445L) following the operation manual. 
Samples were prepared for biotin pull-down by washing with 50 ul of 
10 mg mI" Dynabeads MyOne Streptavidin T1 beads (Life Technologies, 
65602) in 100 pl of 1x Tween washing buffer (1x TWB: 5 mM Tris-HCl, pH 
7.5; 0.5 mM EDTA; 1M NaCl; 0.05% Tween 20). Samples were separated 
ona magnet and the solution was discarded. Ligation products were 
mixed with 83.5 pl of 2x binding buffer (2x BB: 10 mM Tris-HCl, pH 7.5; 
1mM EDTA; 2M NaCl) and centrifuged at 10,000g, 1 min. Beads were 
resuspended with supernatant and incubated at room temperature for 
30 min with rotation to bind the biotinylated DNA to the streptavidin 
beads. Samples were again separated on a magnet and the solution was 
discarded. Beads were washed by adding 200 ul of 1x TWB and tubes 
were heated ona Thermomixer at 55 °C for 2 min with mixing. Beads 
were pelleted again using a magnet and the supernatant was discarded. 
This wash step was repeated twice. Beads were then resuspended in 
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100 pl of 10 mM Tris buffer and transferred to a new tube. Beads were 
repelleted and the buffer discarded. Beads were resuspended in 20 pl 
of 10 mM Tris buffer. The Hi-C library was amplified for 12 cycles of PCR 
with Q5 master mix (NEB, M0492L) following the operation manual. 
PCR products were confirmed by analysing 1 pl of product using the 
FlashGel System (Lonza, 57063). PCR was continued with additional 
cycles until bright DNA bands were seen. A bottle of Agencourt AMPure 
XP beads (Beckman Coulter, A63881) was warmed to room temperature 
and gently shaken to resuspend the magnetic beads. One hundred 
microlitres of beads was added to 200 ul of diluted PCR product 
(0.5 volumes). Samples were mixed by pipetting and incubated at room 
temperature for 10 min. Beads were pelleted on a magnet and the clear 
solution was transferred to a new tube. Another 30 pl of beads was 
added to the clear solution (0.65x volume), mixed by pipetting and 
incubated at room temperature for 10 min. Keeping the beads on the 
magnet, samples were washed twice with 200 pl of 70% ethanol (freshly 
made) without mixing. Ethanol was then completely removed. Beads 
were left on the magnet for 5 min to allow the remaining ethanol to 
evaporate. DNA was eluted by adding 20 pl of ddH,O, mixing by pipet- 
ting and incubating at room temperature for 5 min. After separating 
onamagnet, the solution was transferred toa new tube. DNA was then 
quantified and sequenced using an Illumina sequencing platform. 
For human 2-cell, 8-cell, morula, blastocyst and 6-week embryos, we 
performed ultra-low-input Hi-C experiments. At least two replicates are 
prepared for each stage. For spike-in experiments, we performed ultra- 
low-input Hi-C experiments in human 2-cell mixed sample, human 8-cell 
mixed sample, human morula mixed sample, human a-amanitin-treated 
8-cell mixed sample and human 8-cell (a-amanitin plus CTCF mRNA) 
mixed samples. In these spike-in experiments, the indicated stages of 
human embryos were mixed with six mouse morula embryos respectively. 


Mouse sperm and morula embryos collection 

Mouse sperm cells were collected from the cauda epididymis in 8-week 
old C57BL/6J male mice (Beijing Vital River Laboratory Animal Tech- 
nology, stock number 213). The cauda epididymis was isolated and 
washed with 1x PBS three times, and cut into pieces and placed ina 
5 ml 1x EmbryoMax Human Tubal Fluid (HTF) (EMDmillipore, 
MR-070-D) with 10% FBS (Gibco, 16140071) at a 6-well plate (Corning). 
Two cauda epididymis in one well. Sperms were incubated in HTF for 
30 mininaCO, incubator at 37 °C to reduce somatic cells contaminants. 
Then, the supernatant with swimming active sperm was transferred 
to anew well without disturbing the settled somatic cells and low- 
activity sperm on bottom. The incubation-transfer cycle was repeated 
4-5 times. The quality of samples was assessed under microscopy 
before snap freezing. The purity of sperm was over 99%. The sperms 
were then collected at minimal volume and frozen. 

The morula embryos were from the cross of C57 BL/6J female and 
PWK/PhJ male. C57BL/6) female mice aged 4 to 6 weeks were intraperi- 
toneally injected with pregnant mare serum gonadotropin (PMSG, 5UI) 
followed by human chorionic gonadotropin (hCG, 5UI) 48 hlater. Morula 
embryos were flushed with M2 medium (Sigma) from the reproduc- 
tive tract and the uteri at about 70-82 hafter hCG administration. The 
morula embryos with high quality were confirmed with microscope. The 
embryos were serially washed to remove contaminants and were treated 
with acid Tyrode’s solution (Sigma) for a few minutes to remove zona 
pellucida by pipetting through a fine pulled-glass needle. The embryos 
were then manually picked into PCR thin-wall tubes and stored at -80 °C. 

Allanimal maintenance and experimental procedures were carried 
out according to guidelines of Institutional Animal Care and Use Com- 
mittee (IACUC) of Beijing Institute of Genomics, CAS. 


Sperm in situ Hi-C library generation 

The generation of Hi-C libraries with sperm was optimized according 
to previous protocols’. Three million human sperm cells were used 
for each sample. For spike-in experiment, 3 million human sperm and 


1-3 million mouse sperm were mixed together and use for one sample. 
Asacontrol experiment, 3 million human cells HeLa (ATCC CCL-2; tested 
for mycoplasma contamination but not authenticated) and 3 million 
mouse cells HT22 (EMD Millipore, SCC129; tested for mycoplasma 
contamination but not authenticated) were mixed and used for one 
sample, and the Hi-C libraries generation of mixed somatic cell sample 
were followed the methods as previously published’. 

Sperm cells were fixed with 1 ml of freshly made 1% formaldehyde 
solution and incubated at room temperature for 10 min. To quench the 
reaction, 2.5 M glycine solution was added to a final concentration of 
0.2M. Samples were incubated at room temperature for 5 minand then 
centrifuge for 5 min at 3,000g at 4 °C. Supernatant was discarded. The 
pellet was washed with ice-cold 1x PBS and spun for 5 min at 3,000g at 
4 °C. Supernatant was discarded and cell pellets were flash-frozen in 
liquid nitrogenand stored at —80 °C until used, or used immediately. 
The sperm pellet was resuspend with 1 ml of ice-cold Hi-C lysis buffer 
(10 mM Tris-HCI pH 8.0, 10 mM NaCl, 0.2% Igepal CA630, 0.05% L-a- 
lysophosphatidylcholine) with a protease inhibitor cocktail. The cells 
were homogenized 5 times with a Dounce onice and incubate for 30 min. 
Samples were then centrifuged at 3,000g for 5 min and the superna- 
tants carefully discarded. Pelleted nuclei were washed once with 1 ml 
of ice-cold Hi-C lysis buffer. Supernatant was discarded and pellet 
was resuspended with 50 ul of 0.5% SDS and incubated at 62 °C for 
10 min. After incubating, 145 pl of water and 25 pl of 10% Triton X-100 
were added to quench the SDS. Tubes were gently tapped to mix well, 
avoiding excessive foaming and then incubated at 37 °C for 15 min. 
Twenty-five microlitres of 10x NEBuffer 2 and 100 U of Mbol restric- 
tion enzyme (NEB, RO147) were added and chromatin was digested at 
37 °C for overnight with rotation. Samples were incubated at 62 °C for 
20 min to inactivate Mbol and then cooled to room temperature. To 
fillin the restriction fragment overhangs and mark the DNA ends with 
biotin, 50 pl of fill-in master mix (37.5 pl of 0.4 mM biotin-14-dATP, 4.5 pl 
of 10 mM dCTP/dGTP/dTTP mix, 8 pl of SU pI DNA polymerase I) was 
added. Samples were mixed by pipetting and incubated at 37 °C for 
1.5h. Ligation master mix (663 pl of water, 120 pl of 10x NEB T4 DNA 
ligase buffer, 100 pl of 10% Triton X-100, 12 pl of 10 mg mI bovine serum 
albumin, 10 pl of 400 U pI" T4 DNA ligase) was added and samples were 
incubated at 16 °C for over 10 h with rotation. Nuclei were pelleted by 
centrifugation for 5 min at 3000g and were washed with 1 PBS. Pel- 
lets were then resuspended in 400 pl 1x PBS, and 16.7 pl of 20 mg mI 
proteinase K and 40 pl 10% SDS were added, incubated at 55 °C for 30 
min. The sample was added to 43.3 pI 5 M Nacl, 12 p1 0.5M EDTA, 24 pl 
IMDTT, 50 p110%SDS and incubated at 68 °C overnight. The sample was 
cooled to room temperature, and 1,250 ul pure ethanol and 50 p13 M 
sodium acetate, pH 5.2 were added, mixed by inverting and incubated at 
-80 °C for over 1h. Sample was centrifuged at 13,000g, 4 °C for 20 min. 
Tubes were kept on ice after spinning and the supernatant was care- 
fully removed by pipetting. The pellet was washed with 800 ul of 70% 
ethanol twice and centrifuged at 13,000g for 5 min, dissolved in100 pl 
of 1x Tris buffer (10 mM Tris-HCl, pH 8) and incubated at 37 °C for 
15 min to fully dissolve the DNA. DNA was quantified by Qubit dsDNA 
High Sensitivity Assay (Life Technologies, Q32854). Biotin-14-dCTP at 
non-ligated DNA ends was removed with the exonuclease activity of T4 
DNA polymerase (For 5 pg DNA: 1 1110 mg mI BSA, 10 pl 10x NEBuffer 
2,1 110 mM dNTP, and 5 U T4 DNA polymerase (NEB) ina total volume 
of 100 ul). The reaction was incubated at 12 °C for 2h and stopped by 
adding 2 pl 0.5 MEDTA pH 8.0. All reaction products were combined. 
To make the biotinylated DNA suitable for high-throughput sequenc- 
ing using Illumina sequencers, it was sheared to a size of 300-500 bp. 
Sheared DNA was transferred to a fresh 1.5-ml tube, and an equal vol- 
ume of 2x binding buffer (2X BB: 10mM Tris-HCl (pH 7.5); ImM EDTA; 
2M NaCl) was added, then centrifuged at 13,000g for 5 min. Sample 
was prepared for biotin pull-down by washing 50 pl of 10 mg mI Dyna- 
beads MyOne Streptavidin T1 beads (Life Technologies, 65602) with 
400 ul of 1x Tween washing buffer (1x TWB: 5mM Tris-HCl (pH 7.5); 


0.5mM EDTA; 1M NaCl; 0.05% Tween 20), separated on a magnet and 
solution was discarded. Beads were resuspended with sheared DNA 
supernatant, incubated at room temperature for 30 min with rota- 
tion to bind biotinylated DNA to the streptavidin beads, separated on 
a magnet and solution was discarded. Beads were washed by adding 
500 pl of 1x TWB. The tubes were heated on a Thermomixer at 55 °C 
for 2 min with mixing. The beads were reclaimed the beads using a 
magnet and the supernatant discarded. The wash was repeated 
twice. Beads were resuspended in 100 ul 1x Tris buffer (10 mM Tris- 
HCI, pH 8) and transferred to a new PCR tube. Beads were reclaimed 
and buffer discarded. Beads were resuspended in 50 pl 1x Tris buffer 
(10 mM Tris-HCl, pH 8). Then, the the beads binding with fragmented 
DNA were treated with the End Repair/dA-Tailing Module (NEB, E7442L) 
and Ligation Module (NEB, E7445L) following the operation manual. 
After ligation, the beads were washed with 1x TWB twice and 1x Tris 
buffer (10 mM Tris-HCl, pH 8). Beads were resuspended in 20 ul of 
1x Tris buffer (10 mM Tris-HCl, pH 8). The sperm Hi-C library was ampli- 
fied for 10 cycles of PCR with Q5 master mix (NEB, M0492L) following 
the operation manual. PCR products were confirmed by analysing 
1 pl of product using the FlashGel System (Lonza, 57063). PCR was 
continued with additional cycles until bright DNA bands were seen. 
A bottle of Agencourt AMPure XP beads (Beckman Coulter, A63881) 
was warmed to room temperature and gently shaken to resuspend 
the magnetic beads. One hundred microlitres of beads was added to 
200 ul of diluted PCR product (0.5 volumes). Samples were mixed by 
pipetting and incubated at room temperature for 10 min. Beads were 
pelleted onamagnet and theclear solution was transferred toanewtube. 
Another 30 pl of beads was added to the clear solution (0.65 volume), 
mixed by pipetting, and incubated at room temperature for 10 min. 
Keeping the beads on the magnet, samples were washed twice with 
200 pl of 70% ethanol (freshly made) without mixing. Ethanol was then 
completely removed. Beads were left onthe magnet for 5 minto allow 
the remaining ethanol to evaporate. DNA was eluted by adding 20 pl of 
ddH20, mixing by pipetting and incubating at room temperature for 
5 min. After separating on a magnet, the solution was transferred toa 
new tube. DNA was then quantified and sequenced using an Illumina 
sequencing platform. 


CTCF western blot 

To confirm the CTCF or RAD21 antibody, 0.5 million HEK 293 cells 
(obtained from ATCC: ATCC CRL-1573; tested for mycoplasma con- 
tamination but not authenticated) and 0.5 million 4T1 cells (obtained 
from ATCC: ATCC CRL-2539; tested for mycoplasma contamination 
but not authenticated) were resuspended separately in 50 pl 4% SDS 
and boiled for 5 min. To exclude the possibility of the CTCF antibody 
false positive detection on the alternate paralogue protein, we trans- 
fected HEK 293 cells and HeLa cells with human CTCF siRNA no. 1(GGAG 
CCUGCCGUAGAAAUUTT)”S and human CTCF siRNA no. 2 (CAGAG 
AAAGUGGUUGGUAA)” or control oligonucleotides for two-rounds 
(denoted as siCTCF cell lines and control cell lines). The HEK 293 cells 
and HeLa cells were also resuspended separately in 50 pl 4%SDS, boiled 
for 5 min. Protein concentration was assessed by BCA Protein Assay 
Kit (Cat no: CWO014, CWBIO). SDS-PAGE loading buffer (5x) (cat no: 
CW0027, CWBIO) was added into sample before loading to SDS-PAGE 
gel. The same total protein amount, 20 pg, was loaded. 

For mouse and human sperm samples, 10 million sperm cells were 
resuspended separately in 50 p11 4%SDS, and sonicated using Diagenode 
Bioruptor sonication device for 10 cycles by 30s on and 30s off, then 
boiled for 30 min. Protein concentration was assessed by BCA Protein 
Assay Kit (cat no: CWO014, CWBIO). SDS-PAGE loading buffer (5x) 
(cat no: CWO027, CWBIO) was added into sample before loading to 
SDSPAGE gel. The same total protein amount, 30 pg, was loaded. 

Samples were run on 4-12% SDSPAGE and transferred to 0.2-um 
nitrocellulose membrane (Bio-Rad). Transfer was carried out in25mM 
Tris base, 192 mM glycine, 20% methanol solution. The membranes 


were were stained with Ponceau red and photographed using a camera, 
then were blocked with 5% (w/v) milk buffer PBS/0.1% Tween 20) for 1hat 
room temperature. The primary rabbit monoclonal antibody used was: 
CTCF (cat no: ab188408, Abcam) or RAD21 (cat no: ab992, Abcam) ata 
dilution of 1:1,000, 4 °C for overnight. Membranes were washed three 
times with TBST. The secondary antibody used was: goat anti-rabbit IgG, 
HRP conjugated (cat no: CW0103, CWBIO) at a dilution of 1:1,000, 1h 
at room temperature. Membranes were washed three times with TBST. 
Signal development, followed the kit manufacturer’s recommendations 
(cat no: CW0049, CWBIO). Excess reagent was removed and membrane 
was covered in transparent plastic wrap. Image was developed ina dark 
room for chemiluminescence or with digital imaging equipment. 


ZGA block with a-amanitin treatment in human embryos 

Zygotes were collected after IVF or ICSI. Zygotes were transferred to G1 
mediainthe presence or absence of a-amanitin (25 ng pl, Sigma-Aldrich). 
Afterthe control and a-amanitin-treated embryos reached the 8-cell stage, 
embryos of high quality were collected. Others with fragments or those 
that were arrested at other stages were discarded. The zona pellucida 
was gently removed by mechanical dissection with a glass needle. The 
cells were washed three times with 1x PBS to avoid potential contamina- 
tion. They were then prepared for Hi-C, RNA-seq and immunostainning. 


CTCF knockdown by siRNA injection and CTCF overexpression 
by mRNA injection in human embryos 
To investigate the function of CTCF inhuman embryonic TAD establish- 
ment, two independent siRNAs targeting human CTCF were used for 
microinjection: human CTCF siRNA no. 1(GGAGCCUGCCGUAGAAAUU 
TT)?8; human CTCF siRNA no. 2 (CAGAGAAAGUGGUUGGUAA)” and 
negative control siRNA (UUCUCCGAACGUGUCACGUdTdT). For this 
assay, the donated IVM oocytes were used. They were in vitro-ferti- 
lized and cultured in G1.5 medium as described above. siRNA solution 
(10 mM) was loaded into injection pipette and injected into the zygote 
before pronuclear fading using Eppendorf PiezoXpert and Eppendorf 
CellTram vario microinjector. The injected embryos were culturedinG1.5 
medium (Vitrolife) ina humidified atmosphere at 37 °C with 6% CO, in air. 
The injected embryos with normal morphology were collected at morula 
stage after fertilization. For RNA-seq library preparation, only one morula 
embryo was used for each assay. Morula embryos were collected for 
RNA-seq, Hi-C library generation and immunostainning, respectively. We 
collected three replicates in total for siCTCF morula: one replicate with 
human CTCF siRNA no. 1 (siC7CF morula no. 1) and two replicates with 
human CTCFsiRNA no. 2 (siC7CF morula no. 2 and siC7CF morulano. 3). 
To investigate whether human embryonic TAD establishment dur- 
ing ZGA is soley due to CTCF expression, we overexpressed CTCF in 
a-amanitin-treated embryos. The cDNA of complete open reading 
frame of human CTCF gene was cloned into the pCDNA3.1 vector. These 
plasmids were first linearized with NotI and mRNA was synthesized 
with SP6 polymerase using the mMessage mMachine kit (Ambion). The 
1.5 1g pl mRNA was injected into the zygotes. Zygotes were transferred 
to G1 medium in the presence of a-amanitin (25 ng pl, Sigma-Aldrich). 
After these embryos reached the 8-cell stage, embryos with high qual- 
ity were collected. Others with fragments or arrested at other stages 
were discarded. The zona pellucida was gently removed by mechanical 
dissection witha glass needle. The cells were washed three times with 
1x PBS to avoid potential contamination. They were then prepared for 
Hi-C library generation and immunostainning. 


SMART-seq2 library preparation 

The human embryos were lysed directly and used for cDNA synthesis 
using SMART-Segq v.4 Ultra Low Input RNA Kit for Sequencing (Takara, 
634888). In brief, the sample volume was adjusted to 9.5 pl with nucle- 
ase-free water. After adding 1 p11 10x reaction buffer (0.95 il 10x lysis 
buffer, 0.05 pl RNase inhibitor), samples were incubated at room tem- 
perature for 5 min, then placed on ice. Next, 2 pl of 3’ SMART-Seq CDS 
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Primer II A (12 mM) was added. Following incubation at 72 °C for 3 min, 
samples were placed onice for 2 min. cDNA synthesis reaction was set 
up by adding 4 pl 5x ultra low first-strand buffer, 1 pl] SMART-Seq v.4 
Oligonucleotide (48 mM), 0.5 pl RNase Inhibitor (40 U ml“) and 2 pl 
SMARTScribed reverse transcriptase. The reaction was performed in 
a thermal cycler with following program: 42 °C for 90 min, 70 °C for 
10 min, 4 °C hold. The first-strand cDNA product was amplified by 
adding 25 pl 2x SeqAmp PCR Buffer, 1 tl PCR Primer II A (12 mM), 1 pl 
SeqAmp DNA Polymerase and 3 I nuclease-free water. Sixteen rounds 
of PCR amplification were used with the following program: 95 °C for 
1 min; 98 °C for 10s, 65 °C for 30 s and 68 °C for 3 min, repeat these 
3 steps 15 times; 72 °C for 10 min; 4 °C hold. The amplified cDNA was puri- 
fied using 1 volume SPRIselect beads, then fragmented to 200-400 bp 
using a Covaris sonicator (Covaris). Sequencing libraries were prepared 
with NEBNext Ultra II DNA Library Prep Kit for Illumina (NEB, E7645S) as 
described above. To obtain an adequate amount of DNA for sequenc- 
ing, the cycle of PCR amplification was determined according to the 
amount of 1 pl amplified DNA, which was evaluated using FlashGel 
System (Lonza, 57063). The libraries were sequenced on Hiseq X10 
with paired-end 150 bp (Illumina). 


Immunostaining 

After removal of zona pellucida, human embryos were washed three 
times in 1xPBS, and fixed in PFA solution (4% paraformaldehyde, 0.04% 
Triton, 0.3% Tween in 1xPBS) for 20 min at RT. After washing three times 
in1xPBS for 5 min, the embryos were permeabilized in 1xPBS containing 
0.5% Triton for 20 min. Embryos were incubated with blocking buffer 
(3% BSA, 0.01% Tween-20 in 1x PBS) for 2h at room temperture, then 
incubated with CTCF antibodies (cat. no.: ab188408, Abcam, 1:500 
dilution) for about 12 hat 4 °C. After washing 6 times in 0.1%Triton 
in 1x PBS (PBST), embryos were blocked in blocking buffer again for 
30 min and then incubated for 1h at room temperature with Alexa Fluor 
488-labelled goat anti-rabbit IgG (Beyotime, P0176). Embryos were 
washed in PBST 3 times, then stained with DAPI (Beyotime, C1002) for 
10 min. After washing in PBST 3 times, embryos were mounted in Pro- 
long Gold antifade reagent (Thermo Fisher, P10144). Confocal images 
were obtained by Zeiss LSM 710 confocal microscope using a 63x oil 
objective. All the staining assays were repeated independently at least 
twice. Images were processed and quantified by ZEN software (2010) 
and Image) software (v.1.52a). 


Hi-C data processing 
The HiCExplorer suite*** (v.2.1) was used for the processing of valid 
mapped Hi-C reads. Ultra-low-input Hi-C data were first mapped to 
human genome hg19 using bwa mem with parameters ‘-E50 -LO’. Then, 
we filtered out read pairs which were not uniquely mapped (mapping 
score<15). Dangling end reads, same fragment reads, self-circled reads, 
self-ligation reads and other invalid Hi-C reads were also discarded. 
Details about data quality are summarized in Supplementary Table 
1. After removing duplication, reads were used to generate raw Hi-C 
matrix at 10 kb, 40 kb, 100 kb and 200 kb resolution using hicBuildMa- 
trix. Inthe raw contact matrix, rows and columns with zero or small total 
counts were removed because these bins were mostly from repetitive 
regions. After filtering low-count bins, the matrices were corrected by 
the ICE method” using hicCorrectMatrix. We also converted valid read 
pairs into .hic format files with the juicer tool®? pre command and use 
these .hic files for heat map visualization. 

In addition, to avoid read depth influence on data interpretation, 
we also randomly picked up equal reads for each stage in the analysis 
when mentioned. 


Reproducibility score 

Reproducibility score is calculated from GenomeDISCO software”. It 
uses random walks on the contact map graph for smoothing before 
comparing the interaction maps, resulting in a reproducibility score 


that can be used for quality control of biological replicates. We calcu- 
lated GenomeDISCO reproducibility score at 200-kb resolution for 
biological replicates. 


TAD and TAD boundary identification 

TADs and TAD boundaries (immature and mature TAD boundaries) 
were identified based on public TAD separation score method** with 
hicFindTADs command in HiCExplorer suite (v.2.1)2°"". The TAD sepa- 
ration scores in human embryos were calculated at 40 kb resolution 
using parameters ‘-minDepth 300000 -maxDepth 3000000 -step 
300000-minBoundaryDistance 400000 -thresholdComparisons 
0.01 -correctForMultipleTesting fdr —-delta 0.01’. 

The principle for TAD separation score and boundary detection is 
based ona graph clustering called the conductance of a cut. The con- 
ductance of acut measure calculates the interactions between two clus- 
ters and the interactions within the clusters. It allows the identification 
of good clustering characterized by few interactions between clusters. 
Itis similar to the TAD feature. Thus, TADs can be regarded as graph 
clusters of consecutive nodes (bins), and domain boundaries can be 
regarded as genomic positions that best separate two clusters (TADs). 

The hicFindTADs command first transformed the Hi-C contact matrix 
intoaz-score matrix A =a; Inthe z-score matrix, each contact frequency 
is transformed into a z-score from the input contact matrix based on 
the distribution of all contacts at the same genomic distance. For an 
arbitrary bin /, the contacts between an upstream and downstream 
region of length ware all included in the z-score A[a,, B,]with rowindex 
a,€ {l- w, ..., and column index BE iL ..., [+ wh. For each matrix bin, 
we compute the TAD separation score (w) as the mean value of A[a, Bl. 
To reduce noise and improve robustness, a multi-scale version of the 
TAD-separation score is computed for different values of w that are 
averaged per bin from 300,000 to 30,000,000 with step 300,000 in 
an exponential size-growth way (minDepth + (step x int(x)!°) for xin 
[0,1,2,...]). Genomic bins with alow TAD-separation score (local minima) 
are indicative of boundaries and stronger boundaries will have lower 
scores. For the local minima bini, i- vandi+vare the bins at distance 
v upstream and downstream of i, respectively. We use the Wilcoxon 
rank-sum test to compare the values of A[q;, 8] with the values of the 
other two submatrices Ala;_,,8_,] and A[q;,,, B.,,], respectively. 
The higher of the two Pvalues is used. We corrected the Pvalues using 
the false discovery rate (FDR) method, and regions with q < 0.01 were 
reported as boundaries. TADs were identified as regions between two 
boundaries. 


Directional index and insulation score 
Directional index’ can quantify the degree of upstream or downstream 
bias of a given bin. We calculated it for each bin based on the ICE- 
normalized and depth-normalized matrix as previously described®. 
Insulation score was calculated as previously described” with the 
public code on Github (matrix2insulation.pl; https://github.com/ 
dekkerlab/giorgetti-nature-2016). A sliding 480 kb x 480-kb square 
along the matrix diagonal was used. The IQR mean signal within the 
square was then assigned to the each 40-kb diagonal bin. This proce- 
dure was then repeated for all 480-kb diagonal bins. The insulation 
score was normalized relative to all of the insulation scores across each 
chromosome by calculating the log, ratio of each bin’s insulation score 
versus the mean of all insulation scores. Valleys or minima along the 
normalized insulation score vector represent the loci of reduced Hi-C 
interactions that occur across the bin. These valleys or minima are 
interpreted as areas of high local insulation. 


TAD structure strength 

TAD structure strength was quantified as the ratio of the interactions 
within-TAD over the between-TAD intensity as previously described”. 
For 90 x 90 contact enrichment map, the interactions within-TAD was 
the sum of the central square of the enrichment map, rows 30 to 59 and 


columns 30 to 59 ([30:60, 30:60]) and the interactions in upstream 
and downstream neighbouring blocks was the between-TAD intensity, 
half of the sums of the regions [0:30, 30:60] and [30:60, 60:90]. TAD 
structure strength was both calculated for replicates and the merged. 


TAD signal variance 

TAD signal variance can indicate the strength of the TAD structure and 
calculated as previously described*’. We used intra-chromosomal maps 
at 40-kb resolution for each embryo stage with equal reads. In brief, 
TAD signal was calculated as the log, ratio of the number of corrected 
upstream-to-downstream interactions within a 2-Mb window in each 
chromosome. Regions which contained less than 10 counts or gaps 
within a 2-Mb distance were filtered out. The variance was calculated 
for individual chromosomes. Comparisons between different stages 
were performed only for the bins shared by all of the different stages. 
Relative TAD signal variance was calculated relative to the minimum 
TAD signal variance. We also performed a downsampling analysis for 
human sperm and human blastocysts. Variance due to sampling noise 
was confirmed to be proportional to 1/reads. The Wilcoxon rank-sum 
test was applied to test statistical significance. 


Hi-C data mapping for mixed sample 

After sequencing for the mixed samples, the human reads and mouse 
reads were separated from the mixture by using human genome 
reference and mouse genome reference. We mapped the mixed sample 
data to human hg19 genome and mouse mm10 genome with bwa mem 
with parameters ‘-ESO -LO’, respectively. Separated data for human and 
mouse were processed as mentioned in the section ‘Hi-C data process- 
ing’. In addition, we also combined the hg19 and mm10 genome together 
as reference genome. We mapped the reads to the combined reference 
genome. Those reads, with one end mapped to human and the other 
mapped to mouse genome, were all discarded. 


A/B compartmentalization, compartment strength and A/B 
compartment dynamics 

PC1 values from principal component analysis on correlation heat 
maps have been used to indicate the A/B compartment status!°. We 
used HOMER” software with parameters ‘-res 100,000, -superRes 
400,000’ to obtain the PC1, PC2 and PC3 value and parameters ‘-res 
500,000, -superRes 500,000’ for equal read depth samples. Because 
sometimes PC2 or PC3 values reflect A/B compartments, we manually 
checked of PC1, PC2 and PC3 track with gene density and the plaid 
pattern inthe correlation heat maps along each chromosome and got 
final ‘PCI’ list. The direction of the Eigen (PC) values is arbitrary, and 
therefore positive values were set to ‘A’ and negative values were set 
to ‘B’ based on their association with gene density. 

Compartment strength was defined as the natural logarithm of the 
AA*BB/AB“ according to the public description”. Enrichment contact 
maps of A/B interactions in Hi-C data were calculated at 500-kb resolu- 
tion. AGC content profile was separated into 5 bins: minimum to 20th 
percentile, 20th percentile to 40th percentile and so on. For each pixel 
ofthe 5 x5 matrix, O/E values were then calculated for loci belonging to 
each pair of bins as contact enrichment. GC content is known to highly 
correlate with the compartment profile. Thus GC content was used for 
compartment strength instead of PC1 values. We used the top 20% of 
GC content for A, and the bottom 20% GC content for B. For the error 
bar in evaluating the compartment strength in Fig. 1d, we obtained 
100 5 x 5compartment enrichment matrices by bootstrapping. For 
each pixel of the 5 x 5 compartment enrichment map, we took all the 
observed-over-expected values that contributed to this pixel and took 
arandomsample with replacement of the same size that the contribut- 
ing values. We then proceeded with downstream for each of the 100 
reshuffled maps. 

We identified A/B compartment dynamics regions as statistically 
significant variability in PC1 values across developmental stages 


using analysis of variance with adjusted P value < 0.05. Genes with A/B 
compartment switches were analysed for Gene Ontology (GO) enrich- 
ment with DAVID 6.8 (with parameters count 23; ease <0.05). 


Density of paired loci at different interaction insert sizes 

We examined the relative abundance of paired loci at different insert 
sizes inhuman sperm and mouse spermas previously described®. We 
assigned the unique paired reads to restriction enzyme fragments and 
plotted the distribution of the log,, of insert size between fragments. 


Contact probability 

Contact probability was calculated as previously described” and imple- 
mented with command line tool ‘pairsqce.py’ (https://github.com/ 
4dn-dcic/pairsqc). It first divides distances into logarithmically spaced 
bins at log,, scale at interval of 0.1 (growing by about 1.25-fold). For each 
bin, contact probability is computed as number_of_reads/number_of_pos- 
sible _reads/bin_size. Number_of_reads is the number of interactions at 
corresponding distances. Number_of possible_reads is computed as the 
sumofL_chr—s mid-1overallchromosomes, whereL_christhelength ofa 
chromosome.S. midis the mid point of the bin atlog,, scale (bin10°° - 10™ 
has mid point 10*°). Bin_sizeis computed as max distance - min distance 
(for example, for bin 10**5.0 ~ 10**5.1, the binsize is 10**5.1- 10**5.0). 


Region comparison between human genome and mouse genome 
For the region comparison between human and mouse, regions were 
shifted using the UCSC liftOver tool online for hg19 and mm10 genome 
with default parameters. For ZGA boundary overlap calculation, we 
lifted the mouse ZGA boundaries to human hg19 genome. 


Identification of stage-specific gained boundaries 

To analyse insulated boundaries between embryo stages, we first 
merged boundaries within 200 kb of each other across all examined 
embryo stages as common boundary reference (200-kb collapsed 
boundaries). It increased the robustness of boundary comparison. 
For stage-specific gained boundaries, we selected those boundaries 
that did not exist in previous stages, first emerged in the indicated 
stage and persisted until in blastocysts. For example, human 8-cell 
gained boundaries refer to this kind of boundary: they do not function 
as insulated boundaries in the 2-cell stage, and at the 8-cell stage they 
can be observed with evident insulation 


Enrichment analysis of repeats on stage-specific gained 
boundaries 

The repeat frequency in the genome was calculated for every 1-kb bin 
with bedtools coverage command. For generating the enrichment 
plots for repeats, the mid-point of each stage-specific gained bound- 
ary region was identified, and repeat frequency was calculated in 25-kb 
bins for +1 Mb from the boundary mid-point centre. 


FIREs calling 

FIREs calling was based on the previous method”. For each 40-kb bin, the 
raw count was calculated as the total number of chromatin interactions 
within 200 kb. A normalization pipeline, ‘HiCNormCis”, was applied 
to normalize raw count. A Poisson regression model was fitted for each 
40-kb bin taking the raw count as the outcome variable, and three local 
genomic features, such as effective fragment length, GC content and 
mappability score, as the covariates. The residuals from the Poisson 
regression model were used as the HiCNormCis output. Next, foreach 
40-kb bin, z-score was calculated based on the HiCNormCis output. 
FIREs are defined as bins with a one-sided P value less than 0.05, cor- 
responding to —In(P) greater than 3. -In(P) was chosen as the FIRE score. 


DNA methylation data analysis 
The humanearly embryos and sperm DNA-methylation data were down- 
loaded from CRAO00114*°, which had been deposited in the Genome 
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Sequence Archive (GSA). Low-quality reads were removed by Trim- 
momiatic v.0.32. The filtered reads were aligned to hg19 genome by 
Bismark with default parameters. PCR duplicates were removed by 
Picard and overlapped parts of pair-end reads were trimmed from one 
end by bamUtil. The methylation level of each CpG site was calculated 
by acustom script. 


DNase-seq data analysis and ATAC-seq data analysis 

DNase-seq data was analysed as previously described”>. Low-quality 
reads were removed and the remaining reads were cropped to 100 bp 
by Trimmomatic v.0.32. Paired reads1 and unpaired reads were used for 
mapping to human genome hg19 with Bowtie v.1.2.0” with parameter 
‘“-m 1’. Low mapping quality (MAPQ < 10) and PCR-duplicated reads 
were removed by SAMtools and Picard*°“. DHSs were called by hot- 
spot algorithm with FDR <0.01 (ref. *”). DHSs covered by less than 8 
reads were filtered out. DNase-seq signal tracks for visualization were 
generated by bamCoverage in Deeptools2 suite*?. ATAC-seq data were 
downloaded and analysed from GEO: GSE101571. 


RNA-seq data processing 

Human early embryo RNA-seq raw data were downloaded from 
GSE36552”. We used Refseq genes from the UCSC genome browser and 
Ensemble genes as gene annotation. Housekeeping gene annotation are 
downloaded froma previous publication“. Low quality RNA-seq reads 
and adaptor sequences were removed by Trimmomatic v.0.32. Then 
reads were mapped to the hg19 genome by STAR (v.2.5.2b)* with default 
parameters. Gene expression FPKM was calculated by HOMER software. 
The ZGA genes were defined as those obviously upregulated in human 
8-cell embryos (FPKM >1in8-cell embryos and fold change >3 compar- 
ing with 2-cell embryo)”. To analyse the expression of Alu repeats in 
human early embryos, the total RNA-seq data were downloaded from 
GSE85632”.. The reference repeat annotation was downloaded from 
rmsk in UCSC Table Browser. The FPKM value of each repeat or trans- 
poson was calculated by HOMER analyseRepeats.pl with parameters 
“-fpkm’ and plotted as tracks in Extended Data Fig. 7j. 

To find differential expression genes between a-amanitin 8-cell and 
control 8-cell or between siC7CF treatment and siControl morula, 
HOMER software command getDiffExpression.pl with -DESeq2 was 
employed. FDR <0.05, |fold change|>2 were used as cut-off for signifi- 
cant differently expressed genes. 

RNA-seq tracks for visualization were generated by bamCoverage 
program in DEEPtools2 with parameter ‘-normalizeUsingRPKM’. 


Statistics and reproducibility 

R and Prism were used for statistics analysis. Wilcoxon rank-sum 
test was used for statistical significance (Figs. 1b, 4b, Extended Data 
Figs. 2i, 4b, left, 4d-f, 7b, 8a, 9e, 10d, e) with the wilcox.test function 
in R. x’ test was applied for the statistical significance of the overlap 
between human ZGA boundaries and mouse ZGA boundaries relative 
to random (Fig. 3c) and the enrichment of FIREs in A compartments 
(Extended Data Fig. 10c). Two-sided Kolmogorov-Smirnov test were 
used for comparison the distance distribution of stage-gained insulated 
boundaries to the closet housekeeping genes in Fig. 3d and Extended 
Data Fig. 7a. One-sided t-test is used to calculate statistical significance 
for A/B compartment strength (Fig. 1d) and differential FIRE score 
(siC7CF — control) in FIREs and non-FIREs (Extended Data Fig. 10f). 
Fisher’s Exact test was used for GO enrichment analysis by DAVID tool 
(v.6.8) with P< 0.05 cut-off in Extended Data Fig. 4b. For box plots, boxes 
represent the 25th, 50th and 75th percentiles and whiskers show 1.5x 
the interquartile range. *P< 0.05, **P< 0.01, ***P< 0.001; NS, P> 0.05. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Data generated for this study have been deposited to the Genome 
Sequence Archive with the accession number CRAOO0852 and 
CRAOOO108. Hi-C data of unmixed human sperm sample have been 
deposited in CRAOOO108, and the other data have been deposited 
in CRAOO0832. Raw image files used in the figures that support the 
findings of this study are available from the corresponding authors 
upon reasonable request. 


Code availability 


Codes used for the analysis reported in this study are available at 
https://github.com/ChenXP0310/2019-humanembryo3D. 
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Extended Data Fig. 1| Validation of optimized ultra-low-input Hi-C. 

a, Interaction heat maps for mouse embryonic day (E)3.5 embryos. Top, pooled 
mouse E3.5 embryos Hi-C*®. Bottom, a single mouse E3.5 embryo with ultra-low- 
input Hi-C from this study. b, A track snapshot of PC1 values for chromosome1 
in pooled mouse E3.5 embryos and single mouse E3.5 embryo along with public 
mouse ES cell (mESC) data’. c, Interaction heat maps for pooled mouse E3.5 
embryos and single mouse E3.5 embryo for azoomed-in region from 
chromosome 1 overlaid with TADs, directional index, boundaries and TAD 
separation scores. d, GGnomeDISCO reproducibility (Methods) among the 


Hi-C data for each chromosome in pooled human blastocysts and two human 
single-blastocyst replicates. e, The strength of average TADsin pooled human 
blastocyst data, single human blastocyst data and metaphase HeLa data*®. TAD 
positions are annotated in combined human blastocyst data. TAD structure 
strength (Methods) is shown on each corresponding panel. f, GGnomeDISCO 
reproducibility for biological replicates of human sperm and embryos. n=2-3 
independent biological replicates for each stage. Each dot represents one 
comparison between two biological replicates. Data are meant+s.d. 
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Extended Data Fig. 2 | See next page for caption. 


Extended Data Fig. 2| TAD establishment during human embryonic 
development. a, Interaction heat maps for human embryos with equal reads 
along with human H1ES cells. b, An example for TAD calling by using TAD 
separation score method in human blastocysts with different read depths. The 
orange track under each heat map represents TAD domains. The bottom panels 
are TAD boundaries and TAD separation score tracks. M, million reads.c, Venn 
diagram for the overlap of TAD boundaries of human blastocyst between 10 
million read depth data (10 M, blue) and total combined data (total, red).d, An 
equal-read-depth TAD signal track snapshot for human sperm and embryos 
along with human H1ES cells. e, Average directional index around human TAD 
boundaries(0.5 Mb) at different embryonic stages. Directional index 
generated by arandom blastocyst valid read pair dataset is also shownasa 
control. f, g, Interaction heat map examples at 50-kb resolution for unmixed 


human embryos and human embryos from mixed samples. h, The strength for 
average TADs in mouse morula embryos from mixed samples (six mouse 
morulae for each mixed sample), unmixed mouse morula and mouse MII 
oocyte. TAD positions are annotated in combined mouse E3.5 embryos. TAD 
structure strength is shown on each corresponding panel. i, Relative variance 
of TAD signal for replicates of human embryos including human embryos from 
mixed samples. We used the chromosomeas the unit for this analysis. Two-cell 
(n=3), 8-cell (n=3), morula (n=3), blastocyst (n= 3) and 6-week embryos (n=2). 
Boxes represent the 25th, SOth and 75th percentiles and whiskers show1.5x the 
interquartile range. ***adjusted P< 0.001 for all pairwise comparison 

between two stage samples (two-sided Wilcoxon rank-sum test with 
Benjamini-Hochberg multiple testing correction). NS, not significant. 
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Extended Data Fig. 3| A/B compartmentalization dynamics during human 
embryonic development. a, Pearson correlation heat maps of chromosome1 
at 500-kb resolution in human 2-cell, 8-cell, morula and blastocysts, 6-week 
embryos and H1ES cells (equal no. of reads generated from 2-3 biological 
independent replicates for each stage). b, Top, whole-genome PC1 value circle 
plots for human 2-cell embryos and human blastocysts (with R package 
circlize**). Bottom, PC1 tracks in mouse zygotes and mouse E3.5 embryos. 


c, Pearson correlation heat maps at 500-kb resolution of chromosome lin 
mouse zygotes and early embryos® (equal number of reads generated from 
2-3 biological independent replicates for each stage). d, e, Five-by-five contact 
enrichment maps of A-B compartments averaged over genomic positions 
using GC content in mouse early embryo Hi-C data’ (d) and ref.’ (e). f, PC1 value 
distribution in human embryos and mitotic HeLa cells**. 
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Extended Data Fig. 4 | A/B compartment switches during human embryonic 
development. a, The percentage of genomic regions with A/B compartment 
switches from morula to 6-week stage, and the percentage of single A/B 
switched regions relative to total switched regions. b, Left, the box plot 
illustrates the gene expression dynamics with A/B compartment switches 
between human blastocysts and 6-week embryos (biological replicates pooled; 
blastocyst, n=3; 6-week embryo, n=2). P values are shown (one-sided Wilcoxon 
rank-sum test). Boxes represent the 25th, 50th and 75th percentiles and 
whiskers show 1.5x the interquartile range (b, d-f). Middle, PC1 heat map for 
A/B compartment switched regions. Heat map is sorted according to A/B 
compartment switched PC1 values. Right, GO enrichment for genes with A/B 
compartment switches between blastocysts and 6-week embryos using DAVID 
v.6.8.c, A/B compartment status around the NEFM-NEFL locus in human 


blastocysts and 6-week embryos overlaid with gene expression. 

d, Unmethylated CpG (methylation level <O.25) density within A or B 
compartments inhuman sperm and embryos (biological replicates pooled; 
sperm, n=3; morula, n=2; blastocyst, n=3; 6-week embryos, n=2). Pvalues 
are shown (one-sided Wilcoxon rank-sum test). e, Reduced CpG methylation 
level (ML) (ML perm — MLpiastocyst) in ACOmpartments and B compartments 

of human blastocyst (biological replicates pooled; n=3). Pvalue is calculated 
by one-sided Wilcoxon rank-sum test. f, Reduced methylation level 

(ML perm ~ MLptastocyst) for regions with A-to-A, A-to-B, B-to-A, and B-to-B 
compartment switches from human sperm to blastocysts (replicates 
pooled; n=3).***P< 0.001 for comparisons between ‘A-to-A’ and other groups 
(two-sided Wilcoxon rank-sum test). Right, randomly shuffled A/B 
compartment switched regions as control. 
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Extended Data Fig. 5 | Human sperm have notypical TADs. a, Interaction heat 
maps around Hoxa cluster in mouse sperm®”* and mouse E3.5 embryos®. 

b, Density plot of interaction insert size for human sperm, human blastocysts 
and mouse sperm (Methods). c, Contact probability decay curve for human 
sperm, mouse sperm and metaphase HeLa cells. d, e, Examples of interaction 
heat maps at human and mouse conserved syntenic regions from unmixed 
sperm and mixed human and mouse sperm sample. Interaction heat maps ind 
(right) and e (right) are for HeLa and HT22 cells somatic cell mixed samples, 
respectively. f, Pearson correlation heat maps of chromosome 2 at 500-kb 
resolutionin human sperm. Left, unmixed human sperm Hi-C. Right, human 
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Extended Data Fig. 6 | Insulated boundary dynamics in human embryos and 
mouse embryos. a, An example for insulated boundaries inhumanembryos as 
yellow arrows point in the 2-cell stage. Dark coloured regions in TAD separation 
score heat maps mean strong insulation. The bottom blue bars are insulated 
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at insulated boundaries. a, Cumulative distribution function plot for the 
distance of stage-specific gained boundaries to the closest housekeeping 
genes in mouse embryos® (pooled data from 2-3 biological replicates). The 
dash line marks a distance of 200 kb; 2-cell versus 4-cell, P=1.25 x 107°; 2-cell 
versus 8-cell, P=1.59 x 10" (two-sided Kolmogorov-Smirnov test). b, Gene 
expression (expression data from ref. 7”; 8-cell, n=20; morula, n=16; blastocyst, 
n=30) for housekeeping genes within 200 kb of the stage-specific gained 
boundaries and the other housekeeping genes at the stages when insulated 
boundaries are being formed. Boxes represent the 25th, 5Oth and 75th 
percentiles and whiskers show 1.5x the interquartile range. Pvalues are shown 
(two-sided Wilcoxon rank-sum test). c—e, Enrichment analysis of repeat 
elements at stage-specific gained insulated boundaries in human embryos. 


f-h, Enrichment analysis of repeat elements at stage-specific gained insulated 
boundaries in mouse embryos. Enrichment analysis of Alu repeats (f), LINE 
repeats (g) and MIR repeats (h). i, Examples of the earlier stage gained 
insulated boundary locating around Alu dense regions in human and mouse. 
j, Left, snapshot of insulated boundaries overlaid with insulation score, AluS 
density and housekeeping genes. Grey box highlights the 200-kb collapsed 
boundary. Right, expression of AluS repeats at this boundary. k, Expression 
z-score of AluS repeats locating at human 2-cell gained boundaries during 
human embryonic development (repeat expression data from ref. ”!; n=2 for 
each stage; for repeat expression calculation, see Methods). Average gene 
expression of AluS repeats within a boundary was used. The number of 2-cell 
gained boundaries with AluS repeats is shown (n= 442). 
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Extended Data Fig. 8 | TAD establishment depends on ZGA in human 
embryos. a, ZGA gene expression in human embryos and a-amanitin treated 
8-cell embryos (pooled replicates; for data from ref. ”*: oocyte, n=3; zygote, 
n=3;2-cell,n=6;4-cell, n=12; 8-cell,n=20; morula, n=16; blastocyst, n=30; 
for data from this study: 8-cell, n=2; a-amanitin treated 8-cell, n=2).ZGA genes 
were identified as in ref. 22 (Methods). Pvalues are calculated by the one-sided 
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Wilcoxon rank-sum test. Boxes represent the 25th, 5Oth and 75th percentiles 
and whiskers show1.5x the interquartile range. b, Gene expression for human 
ZGA genes DUXA and ZSCANSB in control 8-cell and a-amanitin treated 8-cell 
embryos. c, The Spearman correlation for two a-amanitin treated 8-cell Hi-C 
replicates. d, Interaction heat maps of human 2-cell, 8-cell, unmixed a-amanitin 
treated 8-cell embryos and a-amanitin 8-cell from the mixed sample (Methods). 
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Extended Data Fig. 9 | CTCF regulates TAD establishment in human 
embryos. a, Gene expression for different cohesin complex subunits during 
human embryonic development. Expression data from ref.”°: oocyte, n=3; 
zygote, n=2;2-cell,n=3; 4-cell, n=4; 8-cell, n=11; morula, n=3. Data are 
mean+s.e.m.b, Dynamics of CTCF expression during human embryonic 
development. Expression data from refs. 7°”). Data are mean+s.e.m.c, CTCF 
gene expression in control 8-cell and a-amanitin treated 8-cell embryos. 
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Extended Data Fig. 10 | The change of FIRE score and gene expression in 
siCTCF morula. a, Snapshot of FIREsin human blastocysts, 6-week embryos 
and H1 human ES cells overlaid with PC1 tracks and FIRE score tracks. One 
zoomed-in regionin human blastocysts is also shown. b, Snapshot of FIREs in 
human morulaand blastocysts overlaid with DHSs”> and A/B compartments. 

c, Bar plot showing FIREs enrichment and depletion in A/B compartments 
(replicates pooled; blastocyst, n=3; 6-week, n=2;H1 human ES cell, n=2). 
Pvalues are also shown (y’ test). d, Box plot showing ATAC-seq reads” signal at 
FIREs, non-FIREs and whole genome in human blastocysts. Boxes represent the 
25th, 5Oth and 75th percentiles and whiskers show 1.5x the interquartile range. 
Pvalues are also shown (two-sided Wilcoxon rank-sum test). e, Box plot 
showing DNase-seq reads” signal at FIREs, non-FIREs and whole genome in 
human blastocysts. Boxes represent the 25th, 50th and 75th percentiles and 


whiskers show 1.5x the interquartile range. P values are also shown (two-sided 
Wilcoxon rank-sum test). f, Histograms of FIRE score difference between 
siCTCF and untreated control morula (siC7CF - untreated) in FIREs and non- 
FIREs for siC7CF morula no. 1, siC7CF morula no. 2 and siC7CF morula no. 3. 
Pvalues for differential FIREscore (siC7CF - untreated) in FIREs and non-FIREs 
are also shown (one-sided t-test). g, The hierarchical cluster of gene expression 
in control morulaand siC7CF morula analysed using the package ggdendro. 

h, Scatter plot of gene expression between control morula and siC7CF morula. 
Red dots refer to upregulated genes (254 genes) in siC7CF morula. Blue dots 
refers to downregulated genes (565 genes) in siCT7CF morula. i, Heat map for 
human ES cell CTCF ChIP-seq signal*’ around gene TSSs downregulated in 
siC7CF morula. Strongly downregulated genes (log,(fold change(siCTCF/ 
control)) <—5). 
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Extended Data Fig. 11| TADs cannot re-establish in 8-cell (a-amanitin plus mix and mouse morula from the a-amanitin plus CTCF mRNA mixed sample 
CTCF mRNA) embryos. a, Immunofluorescence confocal images of CTCF in (Methods). f, Enrichment heat maps for the insulation score of human 2-cell 
a-amanitin treated 8-cell embryos (n= 2) and 8-cell (a-amanitin plus CTCF embryos, 8-cell embryos, a-amanitin 8-cell embryos, 8-cell (a-amanitin plus 
mRNA) embryos (n=3). Scale bar, 40 pm. b, Track snapshot for TAD structures CTCF mRNA) embryos and morula embryos around boundaries (+ 600 kb). 
in 8-cell (a-amanitin plus CTCF mRNA) embryos along with untreated 2-cell, b, boundary centre. Left, boundaries gained at 2-cell stage; right, boundaries 
8-celland morulaembryos. c,d, Interaction heat map examples at 50-kb gained at 8-cell stage. g, Bar plots for TAD structure strength of human 
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used for a Hi-C assay, and 1 embryos can be used to prepare a RNA-seq library. In our analysis, our ultra-low-input Hi-C using 50-100 cells and 
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were consistent for data results. 
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Research sample Describe the research sample (e.g. a group of tagged Passer domesticus, all Stenocereus thurberi within Organ Pipe Cactus National 
Monument), and provide a rationale for the sample choice. When relevant, describe the organism taxa, source, sex, age range and 
any manipulations. State what population the sample is meant to represent when applicable. For studies involving existing datasets, 
describe the data and its source. 


Sampling strategy Note the sampling procedure. Describe the statistical methods that were used to predetermine sample size OR if no sample-size 
calculation was performed, describe how sample sizes were chosen and provide a rationale for why these sample sizes are sufficient. 


Data collection Describe the data collection procedure, including who recorded the data and how. 
Timing and spatial scale | /ndicate the start and stop dates of data collection, noting the frequency and periodicity of sampling and providing a rationale for 
these choices. If there is a gap between collection periods, state the dates for each sample cohort. Specify the spatial scale from which 


the data are taken 


Data exclusions If no data were excluded from the analyses, state so OR if data were excluded, describe the exclusions and the rationale behind them, 
indicating whether exclusion criteria were pre-established. 


Reproducibility Describe the measures taken to verify the reproducibility of experimental findings. For each experiment, note whether any attempts to 
repeat the experiment failed OR state that all attempts to repeat the experiment were successful. 


Randomization Describe how samples/organisms/participants were allocated into groups. If allocation was not random, describe how covariates were 
controlled. If this is not relevant to your study, explain why. 


Blinding Describe the extent of blinding used during data acquisition and analysis. If blinding was not possible, describe why OR explain why 
blinding was not relevant to your study. 


Did the study involve field work? Yes [| No 


Field work, collection and transport 


Field conditions Describe the study conditions for field work, providing relevant parameters (e.g. temperature, rainfall). 
Location State the location of the sampling or experiment, providing relevant parameters (e.g. latitude and longitude, elevation, water 
depth). 


Access and import/export Describe the efforts you have made to access habitats and to collect and import/export your samples in a responsible manner and 
in compliance with local, national and international laws, noting any permits that were obtained (give the name of the issuing 
authority, the date of issue, and any identifying information). 


Disturbance Describe any disturbance caused by the study and how it was minimized. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Antibodies used in western blotting: 
Anti-CTCF antibody, abcam, cat#: ab188408, lot#GR3253930-3, 1:1000 dilution 
Anti-RAD21 antibody, abcam, cat#: ab992, lot#GR239192-1, 1:1000 dilution 
Goat Anti-Rabbit IgG, CWBIO, Cat no: CW0103, lot#013341/50402, 1:3000 dilution 
Immunostaining Antibodies: 
Anti-CTCF antibody, abcam, cat#: ab188408, lot#GR3253930-3, 1:500 dilution 
Alexa Fluor 488-labeled goat anti-rabbit IgG, Beyotime, cat#: P0176, 1:1000 dilution 


Validation All the antibodies used in this study were commercial antibodies and were only used for applications, with validation procedures 
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described on the following sites of the manufacturers: https://www.abcam; https://www.beyotime.com; https:// 
www.cwbiotech.com. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) HEK 293 cells (ATCC: ATCC® CRL-1573™) ;HeLa (ATCC: ATCC® CCL-2™);4T1 (ATCC: ATCC® CRL-2539™);HT22( EMD Millipore, 
SCC129) 

Authentication All these cell lines were used in several papers published by our lab, no further authentication were performed for thess cell 
lines. 

Mycoplasma contamination All cell lines tested negetive for mycoplasma contamination. 


Commonly misidentified lines No commonly misidentified cell lines were used 
(See ICLAC register) 
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Palaeontology 


Specimen provenance 


Specimen deposition 


Dating methods If new dates are provided, describe how they were obtained (e.g. collection, storage, sample pretreatment and measurement), 
where they were obtained (i.e f R state that no new 
dates are provided. 

Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 
Animals and other organisms 
Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 

Laboratory animals Mouse sperm cells were collected from the cauda epididymis in 8 weeks old C57BL/6J male mice (Vital River). The mouse morula 
embryos were from the cross of C57 BL/6J female (Vital River; 4-6 weeks) and PWK/PhJ male (Jackson Laboratory;10 weeks in 
average). 
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Oxygenic photosynthesis supplies organic carbon to the modern biosphere, but it is 
uncertain when this metabolism originated. It has previously been proposed’” 

that photosynthetic reaction centres capable of splitting water arose by about 

3 billion years ago on the basis of the inferred presence of manganese oxides in 
Archaean sedimentary rocks. However, this assumes that manganese oxides can be 
produced only in the presence of molecular oxygen’, reactive oxygen species*” or by 
high-potential photosynthetic reaction centres®’. Here we show that communities of 
anoxygenic photosynthetic microorganisms biomineralize manganese oxides in the 
absence of molecular oxygen and high-potential photosynthetic reaction centres. 
Microbial oxidation of Mn(ID under strictly anaerobic conditions during the Archaean 


eon would have produced geochemical signals identical to those used to date the 
evolution of oxygenic photosynthesis before the Great Oxidation Event’”. This 
light-dependent process may also produce manganese oxides in the photic zones 
of modern anoxic water bodies and sediments. 


Manganese and more than 30 of its described oxides and hydroxides 
mediate the cycling of various trace metals and nutrients in the environ- 
ment. The ability of microorganisms to oxidize Mn(II) anaerobically 
has also been hypothesized to have been a critical step in the evolu- 
tion of oxygenic photosynthesis on the early Earth®. However, modern 
microorganisms are not known to anaerobically oxidize manganese. 
Here we demonstrate this activity in active microbial cultures that grow 
in the presence of nanomolar concentrations of oxygen, relevant for 
the Archaean Earth. 

Inoculum for the enrichment cultures of strictly anaerobic, photo- 
synthetic biofilms came from the meromictic Fayetteville Green Lake 
(New York, USA). The most abundant phototrophin the sulfidic photic 
zone of this lake is the green sulfur bacterium Chlorobium sp.8, a micro- 
organism that uses sulfide as the electron donor for photosynthesis. 
Biofilms (Fig. 1a) containing Chlorobium sp. and other strict anaerobic 
microorganisms were enriched in a minimal medium amended with 
20-50 uM Na,S and 1mM MnCl, and equilibrated with an anaerobic 
atmosphere of 80% N, and 20% CO, at pH 7. The concentration of O,in 
the medium was lower than 2 nM during the entire experiment, bothin 
the presence and the absence of cells (Extended Data Fig. 1, Methods). 
These experimental concentrations match the upper estimates for the 
Archaean Earth’. The anaerobic medium also lacked other potential 
oxidants for Mn(II), suchas nitrite, nitrate and H,O,, and these species 
were not produced in sterile controls (‘Concentrations of dissolved 
species in culture media’ in the Methods). 

Four times more biomass grew in photosynthesizing cultures rela- 
tive to the cultures incubated in the dark (P << 0.001) (Fig. 1b). The 
enrichment protocol yielded a stable community that contained Chlo- 
robium, Paludibacter, Acholeplasma, Geobacter, Desulfomicrobium, 


Clostridium, Acetobacterium and several other bacteria (Fig. Ic). The 
Chlorobium sp. was the most abundant taxon across all conditions, as 
well as the only identifiable phototroph (Fig. 1c). Its genome was 98.4% 
similar to Chlorobium limicola Frassasi. 

This microbial community was essential for the precipitation of 
minerals and oxidation of manganese. Manganese-rich dolomite was 
the most abundant precipitate in photosynthetic cultures amended 
with 0.1-1 mM Mn(i1) and 20-250 uM sulfide” (Fig. 2, Extended Data 
Fig. 2a, b, d), which is comparable to the estimated dissolved Mn(I1) 
concentrations (0.1mM) maintained by the precipitation of manganese- 
containing carbonate minerals on Archaean carbonate platforms”. 
Cultures incubated in this range of chemical conditions also contained 
manganese oxide minerals (Figs. 1d, 2), but cultures incubated with 
approximately 1 1M Mn(iI) or 1mM sulfide did not (Extended Data 
Fig. 2c). Precipitates were entirely absent from sterile controls incu- 
bated inthe light, whereas minor calcite, less dolomite and no manga- 
nese oxides were detected in cultures shielded from the light (Fig. 2). 
Elemental sulfur (S°) accumulated in cultures that did not contain 
oxidized manganese minerals (Extended Data Fig. 3c), as expected 
during the growth of Chlorobium sp. under conditions in which sulfide 
was the main photosynthetic electron donor. Abiotic reactions did not 
contribute detectable amounts of manganese oxides or other minerals 
under our experimental conditions. Instead, the microbial presence 
and photosynthetic activity strongly controlled the nucleation and 
precipitation of minerals, including manganese oxides. 

To characterize the redox cycling of manganese, we examined its oxi- 
dation state by surface-sensitive methods (Extended Data Fig. 3). After 
two weeks, sterile controls and cultures that were incubated in the dark 
contained only Mn(11) (Extended Data Fig. 3a). Dolomite that formed 
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Fig. 1| Biofilms incubated for two weeks. a, Dark brown biofilm incubated in the 
light with Mn(11) (left), less-extensive biofilm incubated in the light without 
Mn(i1) (middle) and a yellow biofilm incubated in the dark with Mn(11) (right). 

b, Biomass of biofilms measured by the crystal violet assay. All incubations were 
performed in triplicate. AU, arbitrary units. n=3 measurements; error 

bars + 0.3s.d.¢, Microbial diversity in biofilms obtained by high-throughput 


in photosynthetic cultures contained Mn(11)”, but we also detected 
Mn(11), Mn(11) and Mn(Iv) in calcium manganese oxides, Mn,O, and 
other minerals (Figs. 1d, 2, 3, Extended Data Fig. 3). The presence of 
manganese oxides in sulfidic photosynthetic cultures was surprising, 
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Illumina sequencing of 16S rRNA genes. The most abundant taxon across all 
conditions was a Chlorobium sp. (30-60%). This microorganism was more 
abundant in photosynthesizing cultures. B, Bacteroidetes; C, Chlorobi; Cf, 
Chloroflexi; F, Firmicutes; P, Proteobacteria; T, Tenericutes. d, Diffraction pattern 
indices for Mn,O, in the spectrum acquired by synchrotron micro-focused X-ray 
diffraction (uXRD) of a biofilm incubated in the light for two weeks. 


but they were repeatedly found in biofilms that were between two weeks 
and two months old (Fig. 2). Acolorimetric assay quantified 5.1+ 0.8 uM 
of oxidized manganese in one-week-old biofilms, 7.2 + 0.8 LM oxidiz- 
ing equivalents in two-week-old biofilms and >10 pM in three-week 
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Fig. 2|XRD spectra of biofilms incubated for two weeks. Top spectrum, 
biofilms incubated in the light with 1 mM Mn(I1). Black lines indicate Miller 
indices (Akl) assigned to each peak of CaMnO,. Bottom spectrum, biofilms 
incubated in the dark with 1 mM Mn(11). The sulfur phases (S) formed during the 
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treatment to remove oxidized manganese minerals before inoculation 
(‘Modified FGL medium’ and ‘Interpretation of XRD peaks’ in the Methods). All 
data are representative of two independent experiments. 


Fig. 3 | Interactions between minerals and microorganisms in biofilms 
incubated with Mn(11) in the light for two weeks. a, Mineral-encrusted cells 
(white arrows) in fixed and stained samples (transmission electron microscopy 
(TEM) at 80 kV); the white square indicates the area magnified in b. Scale bar, 
200 nm.b, Chlorosomes (white arrows) ina cell encrusted by manganese oxide 
precipitates (TEM at 80kV, fixed and stained sample). Scale bar, 50nm.c, TEM 
at 200 kV of unprocessed and unstained mineral-encrusted microbial cells. 
White square indicates the region selected for selected areaelectron 
diffraction (SAED), shownind. Scale bar, 200 nm. d, High-resolution TEM and 


old biofilms (‘Quantification of biofilms and oxidized manganese in 
biofilms’ inthe Methods). All oxidized manganese was determined as 
KMn0O, equivalents and none was detected in dark controls. 
Oxidized manganese was present in minerals that were only found 
on cell surfaces (Fig. 3a) or around extracellular vesicles. These cells 
could be identified as Chlorobium sp. on the basis of the presence 
of large intracellular complexes of photosynthetic antennae called 
chlorosomes (Fig. 3b) and surface protrusions called spinae”. High- 
resolution transmission electron micrographs of fresh cell suspensions 
showed manganese-calcium minerals with a uniform lattice fringe that 
corresponded to the (121) plane with interplanar spacing of 2.64 A of 
calcium manganese oxide (Fig. 3d), Mn,O, (Fig. 3f) and other manga- 
nese minerals (Extended Data Fig. 2). Extracellular vesicles, spinae and 


SAED of minerals around an unstained cell froma fresh suspension of the 
microbial culture ona TEM grid show CaMnO, witha d-spacing of 2.64 A. Scale 
bars, 5nm (d),51/nm (inset). e, TEM at 200 kV of unprocessed and unstained 
encrusted microbial cells. White square indicates the region selected for SAED 
showninf. Scale bar, 200 nm. f, High-resolution TEM and SAED of minerals in 
the area outlined by the white square ine show Mn,0, with a d-spacing of 
3.40A. Scale bars, 5nm (f), 10 1/nm (inset). All data are representative of two 
independent experiments. 


manganese oxide minerals were absent from Chlorobium sp. when the 
cultures were incubated with Mn(11) in the dark or photosynthetically 
with 1 mM sulfide. 

Light-driven manganese oxidation occurred only when Chlorobium 
sp. and other microorganisms, including Geobacter sp., were growing 
together. Oxidized manganese was present in enrichment cultures of 
microorganisms from Fayetteville Green Lake that contained Chloro- 
biumsp., Geobacter sp., Acholeplasma equifetale, Alistipes sp. HGB5 and 
Caldicoprobacter oshimai, but absent from co-cultures of Chlorobium 
sp. and Desulfomicrobium sp. (Extended Data Fig. 4) and pure cultures 
of C. limicola. Manganese oxidizing activity was also detected in the 
co-cultures containing C. limicola, Chlorobaculum tepidum and Geo- 
bacter lovleyi (DSM 245, DSM 12025 and DSM 17278, DSMZ) (‘Probing 
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the redox state of manganese inco-cultures’ and ‘Interpretation of XPS 
spectra’ inthe Methods) andthe co-cultures of C. limicolaand G. lovleyi 
(Extended Data Fig. 5). This suggests that the activity may depend on 
extracellular electron transfer between the latter two organisms" by 
a currently unknown mechanism. The only photosynthetic reaction 
centre that the genome of Chlorobium sp. encodes is a well-studied 
centre with a midpoint potential around +250 mV* and that cannot 
directly oxidize Mn(11) bicarbonate (£, = 520-670 mV)°"*. Chlorobium 
sp. also lacks clear homologues of proteins that are known to oxidize 
manganese under aerobic conditions”. Manganese oxidation in 
C. limicola may occur by an endergonic mechanism, analogous to that 
proposed for the oxidation of nitrite by the Thiocapsa sp. strain KS1”°. 
The electron transfer between C. limicola and G. lovleyi may involve 
high-potential cytochrome cin Chlorobium sp. and OmpB operating 
in reverse in Geobacter sp., but the Mn(II) oxidation mechanisms and 
potential oxidants inthe co-cultures of C. limicolaandG. lovleyiremain 
to be characterized. 

The abundance of oxidized manganese in microbial biofilms, and its 
absence from the sterile controls, shows that microbial consortia can 
mediate the precipitation of manganese oxide minerals under condi- 
tions similar to those in the Archaean eon. These findings expand the 
diversity of minerals and redox processes beyond what was thought 
possible in strictly anaerobic environments or inthe presence of high- 
potential photosynthetic reaction centres. Microbial interactions that 
mediate the light-dependent redox cycling of manganese and couple it 
to other elemental cycles remain to be identified, but can be expectedin 
modern environments in which light, sulfide and dissolved Mn(I1) coex- 
ist and sulfide concentrations do not exceed about 0.2 mM (Extended 
Data Table 1). The light-dependent microbial production of manganese 
oxides is likely to stimulate the redox cycles of carbon, sulfur, nitrogen 
and iron, increase the diversity of anaerobic redox transformations 
(including nitrification—-denitrification”) and influence interpreta- 
tions of isotopic and chemical signatures of these processes in modern 
anaerobic settings. 

The biological production of manganese oxides under Archaean-like 
chemical conditions has additional major implications for determin- 
ing the timing of the origin of oxygenic photosynthesis, which is cur- 
rently debated'”*’. The evolution of oxygenic photosynthesis‘ and the 
presence of locally oxic areas in Archaean marine systems prior to the 
Great Oxidation Event (GOE)” are inferred from geochemical signals. 
However, interpretations of these signals assume the former pres- 
ence of manganese oxides!?”3*, For example, manganese carbonate 
deposits with the negative 6°C values reported in the Neoarchaean 
Sandur schist belt (India)*> or Mesoarchaean Witwatersrand-Mozaan 
strata (South Africa)” are thought to have been produced by the 
microbial reduction of Mn(II1) and Mn(Iv) oxide minerals. In turn, these 
oxides are attributed to the aerobic oxidation of Mn(II) inthe presence 
of oxygen. Molecular clock models are used to independently date 
the evolution of the crown group cyanobacteria and have produced 
estimates that range from before 3 billion years ago to after the GOE, 
depending on the sequence datasets used, prior assumptions made 
and specific model calibrations”®. These models support the radiation 
of anoxygenic green sulfur bacteria (such as Chlorobium) and green 
non-sulfur bacteria after the GOE”’, but also show that stem-group 
green sulfur bacteria diverged as early as 2.9 billion years ago. Given 
that the last common ancestor of modern green sulfur bacteria was 
photosynthetic, model estimates are consistent with anoxygenic photo- 
synthesis within stem-group green sulfur bacterialong before the GOE 
(Extended Data Fig. 6). Therefore, anaerobic manganese oxidation that 
requires anoxygenic photosynthetic activity in the presence of sulfide 
could beas oldas the anoxygenic phototrophic ancestors of any extant 
groups of phototrophs (including green sulfur bacteria, cyanobacteria 
or even an extinct lineage of anoxygenic phototrophs). Because any 
of these scenarios can pre-date the GOE, the relative contributions of 
anaerobic and oxygen-dependent Mn(II) oxidation to the redox texture 
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of sedimentary rocks much before the GOE” are an open question, 
and the loss of mass-independent sulfur isotope signals at the GOE”® 
remains the firmest evidence for the biological production of oxygen. 
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Methods 


Culturing and sequencing 

Enrichment and culturing conditions. Sediments were retrieved from 
Fayetteville Green Lake (FGL) by a metallic gravity scoop and stored at 
4 °Cin fully filled, hermetically sealed glass jars. These samples were 
used as inoculum for enrichment cultures in the FGL medium. All inocu- 
lations were conducted in an anaerobic chamber under a5%CO,/5%H,/ 
balN, (v/v/v) atmosphere using standard anaerobic techniques”’. In 
brief, FGL medium was flushed before and after autoclaving in hermeti- 
cally sealed glass bottles. Sterile flushed FGL medium was inoculated 
inside the anaerobic chamber (5%CO,/5%H,/balN, (v/v/v) atmosphere). 
To avoid any issues associated with the exposure of biofilms to H,, the 
serum bottles were opened inside the anaerobic chamber, inoculated 
in about 1 min, closed immediately, capped and flushed again with 
CO,/N, for 60-75 min to remove H,. All experiments were conducted in 
batch cultures and all cultures were inoculated by approximately 1 mg 
of biofilm that had been washed 6 times by anoxic nanopure water??”!, 
mechanically dispersed by passing througha syringe and resuspended 
in sterile anaerobic medium. All these steps were carried out in the 
anaerobic glove box. All enrichment cultures were incubated at 27 °C 
with an incandescent white light bulb and a12:12-h day:night cycle. All 
plasticware used in the anaerobic chamber was introduced into the 
chamber at least one week before the experiments. 

The culture medium (FGL medium) contained 0.1mM KH,POQ,, 5.61 
mM NH,CI, 0.9 mM KCI, 0.024 M NaHCO;, 1 mM MnCl,-2H,0, 1 mM 
Na,SO, and 1ml/I of trace element solution. The trace element solution 
was prepared in 10% (v/v) HCland contained, per litre, 1.5 g FeCl,-4H,O, 
190 mg CoCl,-6H,0, 100 mg MnCl,-4H,0, 70 mg ZnCl,, 31 mg Na,MoO,, 
6mgH,BO,,2 mg CuCl,-2H,0. The pH of the medium was adjusted to 7 
by the addition of NaOH (1M) or HCI (1M). After adjusting the pH to 7, 
the FGL medium was distributed into glass bottles of different volumes 
(12, 25, 50, 150 and 200 ml). The final background concentration of 
manganese was 0.4 uM. To inhibit the growth of oxygenic phototrophs, 
we added 0.01 mM 3-(3,4-dichlorophenyl)-1,1-dimethylurea (DCMU) to 
the initial enrichments. All analyses described in the main text used later 
enrichments that were grown in DCMU-less medium. Glass serum bot- 
tles were capped by butyl rubber stoppers and aluminium seals. Before 
autoclaving, the FGL medium in bottles was flushed by 20% CO,:80% N, 
for 1h, the serum bottle headspaces for another 40 min total. The bot- 
tles were then autoclaved (40-min sterile). After autoclaving, the FGL 
medium in bottles were flushed again by 20% CO,: 80% N, for one hour, 
the serum bottle headspaces for another 40 min total after cooling. 

A separately prepared selenium stock solution contained 2 mg of 
Na,SeO, in1,000 ml of 0.01M NaOH. This solution was autoclaved and 
made anaerobic by flushing the bottles for 1h and 40 min with 20% 
CO,:80% N,. The vitamin solution was prepared in nanopure water by 
aerobic filter sterilization and contained, per litre, 2 mg biotin, 2 mg 
folic acid, 10 mg pyridoxine-2H,O, 5 mg thiamine-HCI-2H,0, 5 mg 
riboflavin, 5 mg nicotinic acid, 5 mg D-Ca—pantothenate, 0.1mg vitamin 
B12,5 mg p-aminobenzoic acid and 5 mg lipoic acid. 

The master stock solution (20x) contained 1.5 mM MgCl,-6H,O, 1 mM 
CaCl,-2H,0, 1 ml of vitamin solution and 1 ml of the selenium stock solu- 
tionin50 ml nanopure water. The master solution was filter-sterilized 
and flushed for 1h and 40 min by 20% CO,:80% N, gas mixture. This 
solution was added to the FGL medium immediately before inoculation, 
5 ml per 100 ml medium. Manganese was added at the time of inocula- 
tion from concentrated anaerobic stock solution of MnCl,-4H,O (1M). 
Finally, after inoculation, the medium was reduced by the addition of 
sulfide from a concentrated anaerobic stock solution of Na,S-5H,O 
(0.2 M). 

The medium used for the initial enrichment was reduced by the 
addition of 4 mM sodium ascorbate instead of sulfide, to minimize 
the growth of organisms that require high concentrations of sulfide 
as an electron donor. A brown microbial mat formed on the surface 


of the inoculated sediments after 3-4 weeks. Fragments of this mat 
were transferred into the sterile medium with the same composi- 
tion as described above, incubated in the same conditions for one 
month and transferred again. Most experiments described here used 
biofilms that had undergone at least four transfers from the initial 
enrichment. 


Modified FGL medium. All experiments described in the main text 
used the modified basal FGL (MFGL) medium. This medium did not 
contain DCMU, sulfate or ascorbate, and was reduced by 20-50 uM 
Na,S. Sterile MFGL contained only traces of sulfate (<0.9 1M), nitrate 
(<0.5 pM) and nitrite (<0.1 1M), as detected by ion chromatography 
(see ‘Concentrations of dissolved species in culture media’). 

To evaluate the influence of light on growth, biofilms from the third 
transfer were inoculated into the MFGL medium. One triplicate set of 
batch cultures was incubated for 2 weeks in the light at 27 °C at a dis- 
tance of 35cm from an incandescent white light bulb that emits between 
400-700 nm. Another set was incubated at the same time and at the 
same temperature but was shielded from the light by aluminium foil. 

To reduce the carryover of manganese oxides in the biofilm inocu- 
lum, we reduced the inoculums using a previously described proto- 
col. In brief, microbial biofilms that had been grown in the presence 
of light and 1 mM Mn(iI) were collected and incubated in anaerobic 
sterile ascorbic acid (0.25 mM) in the anaerobic chamber for 10 min. 
After this incubation, the biofilms were washed three times with ster- 
ile, anaerobic nanopure water. XRD analyses of biofilms treated in 
this manner showed that this protocol removed manganese oxide 
minerals but increased the abundance of elemental sulfur in the 
inoculum. Elemental sulfur was absent from the biofilms before the 
treatment. 

To characterize the effect of different initial concentrations of 
manganese and sulfide on mineral precipitation, biofilms from the 
third transfer of the original enrichment culture were inoculated into 
the MFGL medium amended by MnCl,, and Na,S from 0.5 Mand 0.1M 
anaerobic stock solutions. All stock solutions were prepared, auto- 
claved and stored under an atmosphere of N,. The effect of manganese 
concentration on manganese oxidation was evaluated in three sets 
of triplicate inoculated cultures that contained 0.1, 1 or 5 mM MnCl. 
All these cultures were reduced with 50 pM NaS. The effect of H,S 
concentration on manganese oxidation was explored in three sets of 
triplicate cultures reduced by 0.05, 0.25 or 1mM of Na,S, all amended 
with1mM MnCl. An additional set of triplicate cultures contained 1mM 
Mn(11) and 0.02 mM NaS. All cultures were incubated for two weeks. 


Further enrichment of manganese-oxidizing and sulfide-oxidizing 
microorganisms. Microbial communities capable of anaerobic oxida- 
tion of manganese were further enriched by inoculating anaerobically 
sealed agar shake tubes with the dispersed biofilms, serially diluting 
the cultures in agar, transferring colonies into liquid medium and re- 
peating the entire process for asecond time®™*. The MFGL mediumin 
agar shake tubes was solidified by 1.1% agar. Biofilms were washed with 
anaerobic nanopure water in the anaerobic glove box and mechanically 
dispersed in10 ml of the basal MFGL medium. The first agar shake tube 
was inoculated with 10% (1 ml) of the dispersed inoculum and diluted 
by 5 successive transfers of 1 ml into 9 ml of sterile MFGL. 

The additions of sulfide and manganese to the basal MFGL in agar 
shake tubes targeted two different conditions: condition (1),0.02 mM 
Na,S and 1mM MnCl, sought to enrich for microorganisms that can 
photosynthesize in the presence of low sulfide concentrations and 
oxidize Mn(I1); and condition (2), 1 mM Na,S (MnCl, added only in the 
trace metal solution), enriched for Chlorobium spp. that can oxidize 
sulfide. Extended Data Table 2 summarizes the enrichment protocol 
and conditions. The shake tubes were incubated at 27 °C at a distance 
of 35cm fromthe incandescent white light bulb. Colonies that formed 
after one month were transferred from the solid medium into the liquid 
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medium that contained the same concentrations of MnCl, and Na,S. 
Biofilms that grew in liquid after one month were mechanically dis- 
persed by a syringe, inoculated into another set of agar shake tubes 
and incubated for one month. Colonies from the shake tubes were 
inoculated again into the liquid medium. The purity of the cultures at 
each transfer was tested by Sanger sequencing. Amplified 16S rRNA 
genes were sequenced in both directions using either 27F (5’- AGAGTTT- 
GATCCTGGCTCAG-3’) or 1492R (5’-ACG GCT ACC TTG TTA CGA CTT-3’) 
(Integrated DNA Technologies), assembled to get a nearly full-length 
16S rRNA gene (GeneWiz) and identified using nucleotide BLAST on 
GeneBank*®. Future experiments should also explore the possibility 
of light-dependent production of organoperoxides in the medium as 
a function of vitamins and Fe(II) in the medium. 


DNA extraction, 16S rRNA gene Illumina sequencing and 
phylogenetic analyses. A 500-l sample of each biofilm enrichment 
(including early enrichments and the lake inoculum) was collected 
and spun down into a pellet. Total DNA was extracted from samples 
using the PowerSoil DNA Isolation Kit (MoBio) according to manu- 
facturer’s instructions and eluted in 60 pl Cé solution. Upon extrac- 
tion, DNA was quantified using NanoDrop (Thermo Scientific). The 
extracted samples and blank-template controls from the PowerSoil 
DNA Isolation kit were stored at —80 °C and sent to Argonne National 
Laboratory on dry ice for sequencing. The community composition 
was characterized using 16S rRNA gene amplicon paired-end sequenc- 
ing onthe MiSeq Illumina platform. In brief, V4 region of the 16S rRNA 
gene (515F-806R) from each sample was amplified using the bacterial- 
specific primers 515F (5’°-GITGCCAGCMGCCGCGGTAA-3’) and 806R 
(5’-GGACTACHVGGGTWTCTAAT-3’)*. After the amplifications, the 
PCRamplicons were quantified using Quant-iT PicoGreen dsDNA Assay 
Kit (ThermoFisher/Invitrogen, P11496) according to manufacturer's 
instructions and pooled in equal concentrations (240 ng) toa single 
tube. This pool was cleaned up using MoBio UltraClean PCR Clean-Up 
Kit (MoBio) and quantified using the Qubit (Invitrogen). The pooled 
samples were sequenced on the Illumina MiSeq platform (Illumina). 
Alllibrary preparations, pooling, quality controls and sequencing runs 
were performed at the Argonne National Laboratory. Sequence data 
were analysed using QIIME v.1.9.0”. Paired-end reads were joined us- 
ing fastq-join method’, and libraries were demultiplexed and filtered. 
Any reads that did not assemble by perfect matches in the overlapping 
region or meet the qg-score (>20) threshold were removed and were not 
used in subsequent analyses. Chimeric sequences were identified using 
UCHIME’s usearch61 de novo-based chimaera detection algorithm” and 
removed from the quality-filtered sequences. Filtered and chimaera- 
free sequences were aligned and clustered into operational taxonomic 
units at >97% similarity level using closed-reference UCLUST algorithm 
against the Greengenes v.13.8 reference dataset as a database*®. The 
most abundant sequence from each cluster was selected as a repre- 
sentative sequence. All representative sequences were aligned using 
PyNAST”. A phylogenic tree for subsequent phylogenetic analyses was 
built using FastTree“. Operational taxonomic unit counts were rarified 
to 10,000 sequences per sample for diversity analysis using taxonomic 
and phylogenetic indices that included the Shannon and Faith’s PD in- 
dex. To identify bacterial taxa with sequences that are more abundant 
in samples grown in light and/or with Mn(I1), we used LEfSe, which 
performs a nonparametric Wilcoxon sum-rank test followed by linear 
discriminant analysis (LDA), coupled with effect size measurements to 
assess differentially abundant taxa”. Chlorobium sp. sequences were 
significantly enriched in samples grown in the presence of light and 
Mn(i1) with LDA scores >5. Cultures grown in light had significantly 
more Chlorobium sp. (analysis of varaiance (ANOVA), F = 23.4521, df 
factor =5, dferror=12, P< 0.0001; Tukey’s HSD, P< 0.01). Sequence data 
are available as FASTQ files at the National Center for Biotechnology 
Information (NCBI) via Sequence Read Archive (SRA), under the SRA 
accession ID number SRP133329. 


Metagenome sequencing and analysis. To determine the metabolic 
potential of cultures grown from colonies that targeted specific growth 
conditions (see ‘ Further enrichment of manganese-oxidizing and 
sulfide-oxidizing microorganisms’), we sequenced their metagenomes. 
The DNA of enrichments obtained using condition (1) was extracted 
using a modified phenol-chloroform method with ethanol precipita- 
tion as previously described*® and quantified by a Qubit 2.0 Fluorom- 
eter (Thermo Fisher Scientific). This DNA was sent for metagenomic 
sequencing to the University of Southern California’s Genome and 
Cytometry Core Facility. The library preparation, quality control and 
sequencing were performed at the Cytometry Core Facility. In brief, be- 
fore sequencing on the Illumina HiSeq 2500 platform, DNA was sheared 
using dsDNA Shearas Plus (Zymo), cleaned up using Agencourt AMPure 
XP beads (Beckman-Coulter), the library was quantified using the Qubit 
2.0 Fluorometer and the DNA fragment size was determined with an 
Agilent Bioanalyzer 2100. 

The quality control of the sequence data was performed using 
Trimmomatic v.0.36 using default parameters and a minimum sequence 
length of 36 bp. IDBA-UD v.1.1.2. was used to assemble the reads with 
a2,000 bp minimum contig length. SAMtools v.1.3.1 was used to con- 
vert files to binary format for downstream analysis. VizBin was used 
to delineate individual genomes from the enrichment metadata*® and 
the genomes were assigned putative taxonomic identities according 
to their placement in a phylogenetic tree in CheckM v.1.0.4 using the 
‘tree’ command”. 

Individual genomes obtained from the metagenome data were 
submitted to the DOE Joint GGnome IMG-MER (Integrated Microbial 
Genomes) pipeline for gene calling and assembly*®. The protein- 
coding-gene prediction tool Prodigal v.3.0.0 was used to determine 
genes in the enrichment grown from the colony on 1 mM MnCl, 
and 20-50 uM NaS. The genome of C. limicola was 98.4% similar to 
C. limicola Frassasi”’. 

To detect putative Mn(II)-oxidizing genes in C. limicola, we first 
generated a blast database of protein-coding Mn(II)-oxidizing genes 
by selecting genes that encode for multi-copper oxidases and animal 
haem peroxidases. Because multi-copper oxidases and animal haem 
peroxidases each contain several classes of enzymes and can trans- 
fer electrons from a number of different substrates, we focused on 
enzymes with confirmed manganese-oxidizing activities by biochemi- 
cal and molecular assays. All multi-copper oxidases and animal haem 
peroxidases involved in Mn(11) oxidation and characterized to date are 
from aerobic microorganisms and include genes such as mnxG, mcoA 
and mopA in Pseudomonas putida'®, mnxG inthe spores of Bacillus strain 
SG-1, moxA in Pedomicrobium sp. ACM 3067°, and mopA in Auran- 
timonas manganoxydans S185-9A1°! and Roseobacter sp. AZWK-3b”. 
To determine whether Chlorobium has any homologues with charac- 
terized manganese-oxidizing multi-copper oxidases and animal haem 
peroxidases, we used BLASTp™ and queried translated Mn(I1)-oxidizing 
genes against the Chlorobium genome with an e-value cut-off of 10° 
and abit score of 30. Homologues of multi-copper oxidases and animal 
haem peroxidases in C. limicola are shown in Extended Data Table 4. 

Sequence data for C. limicola can be accessed at the JGI-IMG under 
IMG submission ID 124328. 


Spectroscopy 

X-ray photoelectron spectroscopy. X-ray photoelectron spectros- 
copy (XPS) was performed ona K-alpha + X-ray photoelectron spec- 
trometer (K-Alpha + XPS, Thermo Fisher). Biofilms were collected 
and centrifuged at 14,000 rpm for 5 min in the anaerobic chamber to 
form pellets. The pellets were placed on double-sided carbon tape and 
dried inthe anaerobic chamber. To maintain the anoxic conditions, the 
samples were stored in the anaerobic chamber in hermetically sealed 
glass vials before analysis. All samples were fractured in high vacuum 
(3 x10°8 Torr) in the outer pressure chamber and then moved directly 


into the main XPS measurement chamber. An incident monochromatic 
X-ray beam from the Al K Alpha target (15 kV, 10 mA) was focused ona 
0.4mm x 0.3-mm area of the surface at a 45° angle with respect to the 
sample surface. Depth profile etching with an etch cycle of 30s anda 
total of 10 levels yielded high-resolution spectra. The electron energy 
analyser perpendicular to the sample surface was operated witha pass 
energy of 50 eV to obtain XPS spectra at a 0.1-eV step size and a dwell 
time of 50 ms. Each peak was scanned 15 times. To ensure representa- 
tive data from heterogeneous samples, we probed a total of 50-80 
points per sample. XPS data were treated and analysed using CasaXPS 
curve resolution software package. Spectra were best fit after Shirley 
background subtractions by nonlinear least squares CasaXPS curve 
resolution software package. Gaussian and Lorentzian contributions 
to the line shapes were numerically convoluted using a Voigt function. 
The different XPS lines with sets of Gaussian and Lorentzian peaks were 
empirically fitted with different standards corresponding to different 
oxidation sets (MnO, MnCO,, Mn,O,, Mn,0,, MnO, and MnCaO,). Each 
manganese XPS spectrum was empirically best fitted with multiple 
standard phases (MnO, MnCO,, Mn,O,, Mn,0,, MnO, and MnCaO,) 
that produced the minimum residual. The average fit properties for 
all treated spectra were acceptable as the following: Rexpectea = 1.60, 
Rorofite = 1.71, significance level = 0.05, residual s.d. = 1.67, goodness of 
fit = 1.78, critical y?=3.84. 


Interpretation of XPS spectra. The redox state of manganese in micro- 
bial cultures was confirmed by XPS (Extended Data Fig. 3). The Mn2p 
XPS spectra of the dark culture exhibited two major peaks at bind- 
ing energies of 640.90 eV and 652.2 eV, which correspond to Mn2p,,3 
and Mn2p,,., respectively. This is consistent with other reports on 
Mn(II) phases of Mn. Inthe photosynthesizing culture, the Mn2p peak 
shifted to a high-energy side and the intense satellite peak character- 
istic of Mn(11) diminished. These biofilms contained manganese in 
different valence states. At some analysed spots, the Mn2p XPS spec- 
trum exhibited two major peaks of Mn2p,,, and Mn2p,, at binding 
energies of 642 eV and 653 eV, respectively. These correspond to Mn(Iv) 
in calcium-manganese oxide phases®. Peaks at Mn2p,,; with binding 
energies 641.61 eV and 641.47 eV, respectively, were also detected. 
These peaks correspond to Mn(III) in Mn,O, and Mn(III) and Mn(I1) 
in Mn,O, phases**™”. 

The redox state of the manganese in the culture enriched in condi- 
tion (1) and condition (2) was confirmed by XPS (Extended Data Fig. 4). 
The Mn2p XPS spectra of this culture (Extended Data Fig. 4a) exhib- 
ited two major peaks at binding energies of 641.41 eV and 653.15 eV, 
corresponding to Mn2p,,, and Mn2p,,, respectively, and matching 
the peaks°***? of Mn,O,. The Mn2p XPS spectra of the condition (2) 
enrichment (Extended Data Fig. 4b) exhibited Mn2p,,, peaks at 640.97 
eV and 652.2 eV, which correspond to Mn2p,,,and Mn2p,,), respectively. 
This is consistent with other reports of MnO phases*”*®. 


Probing the redox state of manganese in biofilms. We used XPS to 
detect oxidized manganese in colonies enriched on1 mM Mn(II) (condi- 
tion (1)). Extended Data Table 3 summarizes the procedure used to study 
the Mn(II) oxidation activity in the enrichment cultures. Manganese 
oxidation was tested using cultures that were enriched as colonies in 
agar shake tubes (condition (1) and condition (2)) (see Extended Data 
Table 3 and ‘Spectroscopy’ and ‘Interpretation of XPS spectra’). Biofilms 
from condition (1) were grown in duplicate 10-ml cultures with 1 mM 
MnCl, and 0.02 mM Na.S, mechanically dispersed and resuspended 
into separate 10-ml liquid solutions. Five per cent v/v of this suspension 
was transferred into 10 ml of MFGL medium with1 mM MnCl, and 0.02 
mM NaS and the cultures were incubated at 27 °C for 1 week before 
the assay. A second assay tested the Mn(II) oxidation activity without 
requiring the very sparse biofilm to grow. The condition (1) enrichment 
was grown for two weeks as described in ‘Probing the redox state of 
manganese in biofilms’, centrifuged at 8,000 rpm in the anaerobic 


chamber, washed 3 times with anaerobic water and transferred into 
10 ml of MFGL with 1 mM MnCl, and 0.02 mM NaS. These cultures 
were incubated for 3 days in a 12:12 h light:dark regime and collected 
anaerobically. This procedure preserved the cell density of the original 
biofilms and did not require microbial growth. 

To test for Mn(II) oxidizing activity in the enrichment from 
condition 2 (Chlorobium sp. and Desulfomicrobium sp.), the culture was 
grown in 10 ml of the basal MFGL amended with 1 mM Na,S, collected 
anaerobically and dispersed in 10 ml of the basal MFGL medium. This 
suspension was used to inoculate 10 ml of the basal MFGL amended 
with 1mM MnCl, and 0.02 mM Na,S at 5% v/v. The inoculated medium 
was incubated in the light/dark regime for one week. To test for Mn(II) 
oxidizing activity without requiring the low-biomass biofilms to grow, 
the enrichment from condition (2) was grown for 2 weeks in 10 ml of 
the basal MFGL amended with 1 mM Na;S, centrifuged at 8,000 rpm 
in the anaerobic chamber, washed 3 times with anaerobic water and 
transferred to 10 ml of MFGL with 1mM MnCl, and 0.02 mM Na,S. These 
cultures were incubated for 3 days in the 12:12 h light:dark regime and 
collected anaerobically. All collected microbial pellets were dried on 
carbon tape and stored anaerobically inside serum bottles with N, 
atmosphere and placed inside the anaerobic chamber in the dark at 
26 °C until XPS analysis. 


Probing the redox state of manganese in co-cultures. C. limicola 
(DSM 245, DSMZ) and C. tepidum (DSM 12025, DSMZ) were inoculated 
with 5% v/v inoculum and grownin 50 ml MFGL medium supplemented 
with 0.05 mM Na;,S and 0.5 g/l yeast extract in a 12:12 h light:dark re- 
gime at 27 °C for 3 weeks. G. lovleyi (DSM 17278, DSMZ) was grown in 
MFGL supplemented with 2.8 mM ferrihydrite and 5 mM acetate and 
reduced with 0.05 mM Na;S in the dark at 21 °C for 3 weeks. Micro- 
organisms from these cultures were inoculated as 5% v/v inoculum 
in the following combinations: C. limicola + C. tepidum, C. tepidum + 
G. lovleyi, C. limicola+ G. lovleyiand C. limicola+ C. tepidum + G. lovleyi. 
Alltheseco-cultures were grownin10 mI MFGL medium with 1mM Mn(11) 
and 0.05 mMNa,S ina 12:12 hlight:dark regime at 27 °C for 2 weeks. The 
biomass was collected anaerobically, the pellets were dried on carbon 
tape and stored anaerobically under N, inside the anaerobic chamber in 
the dark at 26 °C until XPS analysis. The oxidation state of manganese 
was characterized by XPS in all four cultures. 


X-ray powder diffraction 
XRD patterns were obtained in reflection mode with Ni-filtered Cu Ka 
radiation (A=1.5406 A) as X-ray source onan X’Pert PRO diffractometer 
(XRD, X’Pert PRO, PANalytical) equipped with an X’Celerator detector 
(PANalytical). The patterns were measured in 20 range from 3° to 90° 
with a scanning step of 0.008° and a fixed counting time of 600 s at 
45kV and 40 mA. Biofilms were collected and centrifuged at 14,000 rpm 
for 5 min in the anaerobic chamber. Microbial paste was smeared on 
Zero Diffraction Disk (23.6-mm diameter x 2.0-mm thickness, Sicrystal, 
MTI) and dried in the anaerobic chamber. The samples were analysed 
inside the anaerobic dome to maintain the anoxic conditions during 
the XRD analyses. Data were analysed and fitted using High Score Plus 
program version 4.5. The average fit properties for all treated spectra 
were acceptable as the following: residual $.d.=1.63, Rexpected = 1-28, Rorofite 
=1.63, significance level = 0.05, goodness of fit =1.69, critical x? =3.84. 
Precipitated minerals were also analysed using in-situ synchrotron- 
based XRD at the Advanced Light Source at the beamline 12.3.2. Biofilms 
were collected on site and the biofilm paste was loaded into transmis- 
sion sample XRD cells. The transmission synchrotron diffraction data 
were collected using a DECTRIS Pilatus 1M hybrid pixel area detector 
placed at an angle 20 of 35° at approximately 170 mm from the sam- 
ple. The 4-bounce monochromator was set to an energy was 10 keV 
(A=1.239842 A). The sample geometry with respect to the incident beam 
and the detector was carefully calibrated using Al,O, powder. The 2D 
diffraction patterns (Fig. 1d) were analysed and integrated along the 
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azimuthal direction into 1D diffractograms using the X-ray microdif- 
fraction analysis software (XMAS v6) developed at the Advanced Light 
Source for the Advanced Light Source beamline 12.3.2, and Matlab 
R2017a. 


Determination of XRD detection limit. To determine the detection 
limit of XRD, 0.05, 0.01, 0.02, and 1 mg of MnO, was mixed with 10 mg 
of dry anaerobic biofilm that did not contain green sulfur bacteria or 
manganese oxides and spread on Zero Diffraction Disk (23.6-mm diam- 
eter x 2.0-mm thickness, Sicrystal, MTI). The mixtures were analysed 
by X’Pert PRO diffractometer XRD, X’Pert PRO, PANalytical) equipped 
with an X’Celerator detector (PANalytical) over 10-h analysis time. MnO, 
standard and the bacterial biofilm were also run separately as controls. 
The detection limit of XRD was determined by the mass of MnO, that 
yielded discernible diffraction peaks in the XRD spectrum. 


Interpretation of XRD peaks. The XRD spectra of microbial cultures 
incubated in the light with Mn(II) (Fig. 2) showed peaks that can 
be indexed to a ternary manganese oxide; CaMnO, (ICDD-01-016- 
2217) with lattice constants of a=5.2917 nm, b= 7.4803 nm and c= 
5.2870 nm°**. CaMnO, is not known to occur naturally. 

Dolomite was the most abundant phase in the cultures and its peaks 
were indexed as (104), (101), (110), (11-3), (202) and (018) (ICDD-04- 
011-9833). The absence of light inhibited the growth of photosynthetic 
microorganisms and the formation of manganese-oxide minerals, and 
also reduced the precipitation of dolomite (Fig. 2). Biofilm incubated in 
the dark showed the precipitation of calcium carbonate phase, CaCO, 
(ICDD-00-058-0471) indexed for (121) and (102). In addition to the 
various carbonate phases, the XRD spectrum showed two different 
phases of elemental sulfur (S°); (ICDD-04-020-2294) indexed for (110), 
(-101) and (-211) and (ICDD-05-001-0219) indexed for (110) and (101). 

The XRD spectra of microbial cultures incubated at different concen- 
trations of manganese and sulfur showed peaks of manganese oxide, 
dolomite and elemental sulfur (Extended Data Fig. 2). The latter formed 
in microbial cultures that were incubated with 0.25-1 mM H,S and in 
the cultures grown with less than 1 uM Mn(ID) (see the composition of 
the trace metal solution in ‘Enrichment and culturing conditions’). S° 
(ICDD-05-001-0219), was indexed for (110), (-101), (011) and (-211). 


Microscopy 

Scanning electron microscopy. Scanning electron micrographs 
were acquired by a Zeiss Merlin scanning electron microscope with 
the GEMINI II column (Zeiss Merlin SEM, Carl Zeiss microscopy). The mi- 
croscope was equipped with a field gun emission and energy-dispersive 
X-ray spectrometer (EDS, EDAX detector; EDAX) that operated at an 
accelerating voltage of 5-15 kV, probe current of 100 pA and a work- 
ing distance of 8.5 mm. On-axis in-lens secondary electron detector 
was used during imaging. The samples were fixed by 0.2 M sodium 
cacodylate, 0.1% CaCl, and 2.5% glutaraldehyde in anaerobic water 
for 2-3 days at 4 °C. The fixed samples were washed by 0.1M sodium 
cacodylate followed by a wash in nanopure water. After washing, the 
samples were dehydrated in a series of ethanol—-water solutions. The 
ethanol-water solution series included the following dehydration 
steps: 30% (20 min), 50% (20 min), 70% (20 min), 80% (20 min), 90% 
(20 min) and 100% (3 x 20 min) of 200 proof ethanol. After air-drying, 
the samples were mounted on double-sided carbon tape and coated 
with a thin layer 5 nm of Au/Pd or 10 nm of carbon using a Hummer V 
sputter coater. EDS spectra were treated and analysed by TEAM EDS 
2.0 analysis software (EDAX) and Microsoft Excel 2016. 


TEM. Transmission electron micrographs were obtained using FEI 
Tecnai F20 supertwin microscope (FEI Tecnai G2, FEI) with a 200-kV 
Schottky field emission gun. The samples were imaged at 80 kV with 
1,024 x 1,024 CCD Gatan camera (Gatan). The samples were fixed by 
0.2 M sodium cacodylate, 0.1% CaCl,-6H,O and 2.5% glutaraldehyde 


in aerobic nanopure water for 2-3 days at 4 °C. The samples were then 
washed with washing buffer (0.1 M sodium cacodylate in nanopure 
water), postfixed with 1% osmium tetroxide in water for 1h, washed with 
aerobic nanopure water and stained with 1% uranyl acetate for 1h. The 
stained samples were washed with nanopure water and dehydrated in 
an ethanol-water solution series. The ethanol-water solution series 
included the following dehydration steps: 30% (20 min), 50% (20 min), 
70% (20 min), 80% (20 min), 90% (20 min), and 100% (3 x 20 min) of 
200 proof ethanol. The samples were further dehydrated with propyl- 
ene oxide:ethanol solvent (50:50, by vol) for 30 min, then with 100% 
propylene oxide. The epoxy resin used for embedding consisted of 
diglycerol ether of polypropylene glycol (EmBed 812, DER 736, Electron 
Microscopy Sciences, EMS no. 14130,), cycloaliphatic epoxide resin (ERL 
4221 Electron Microscopy Sciences, EMS no. 14300), nonenyl succinic 
anhydride (NSA, Electron Microscopy Sciences, EMS no. 14300) and 
2-(dimethylamino)ethanol (DMAE, Electron Microscopy Sciences, EMS 
no. 14300). The samples were embedded in resin and cut into 80-nm- 
thick sections with a diamond knife using Leica Reichert Ultracut E 
microtome (Reichert Ultracut E microtome, Leica) with a thickness 
setting of 50 nm. Thin sections were placed on FCF-200 grids (Electron 
Microscopy Sciences, FCF-200-Cu). 

To determine whether the fixation and embedding protocols intro- 
duced any artefacts, photosynthetic biofilms were also collected with- 
out any further processing or staining in the anaerobic chamber. A drop 
of microbial culture was deposited on LC-200 grid (Electron Microscopy 
Sciences, LC-200-Cu) and imaged with JEOL 2010F TEM (JOEL 2010F, 
JOEL). The JEOL 2010F TEM is equipped with a Schottky field emission 
gun operating at 200 kV and a Gatan energy filter (GIF, Gatan 200, 
Gatan). The 2010F TEM has micro-diffraction, diffraction pattern in 
parallel beam and convergent beam electron diffraction features to 
allow SAED onselected mineral-encrusted bacteria with a high spatial 
resolution. Gold standard was used as reference for SAED analyses. 
The high-angle annular dark filed detector (Gatan) for atomic resolu- 
tion scanning electron transmission microscopy in the free-lens con- 
trol mode (STEM) and with an EDS (Bruker silicon drift detector SDD, 
Bruker) enabled elemental analysis at nanoscale resolution. Images in 
the TEM and STEM mode were taken by a digital camera (Gatan Orius, 
Gatan). SAED patterns were imaged using Gatan digiscan unit (Gatan). 
TEM, STEM and SAED images were recorded and treated using Gatan 
digital micrograph software (Gatan). EDS spectra were recorded and 
treated using INCA program (Oxford Instruments). 


Interpretation of SAED patterns. Different types of manganese min- 
erals in photosynthetic biofilms corresponded to different stages of 
mineral maturation. High-resolution TEM of the manganese oxide 
nanocluster surrounding a cell (Fig. 3) showed polycrystalline minerals 
with auniform lattice fringe that corresponded to the (116) plane with 
interplanar spacing of 2.71 A of calcium manganese oxide (ICDD-00- 
053-0092). The SAED patterns of minerals that were not associated 
with cell surfaces showed various minerals. One type of manganese 
mineral had four obvious polycrystalline diffraction rings that could 
be observed at 3.65 A, 3.40 A, 2.88 A and 1.83 A. These corresponded, 
respectively, to the (112), (211), (220) and the (323) crystal planes of 
Mn,0, (ICDD-03-065-2776) (Fig. 3). Some globular nanocrystals of 
manganese oxide outside of any microbial surfaces (Fig. 3) showed 
lattice fringes with the interplanar spacing of 2.26 A. This matched the 
characteristic interplanar spacing of the (200) plane of manganese 
oxide type MnO mineral (ICDD-04-004-3858). 


Concentrations of dissolved species in culture media 

Sulfide concentrations were determined using a modification of a previ- 
ously published method” in samples of triplicate cultures for each time 
point. In brief, 200 ul of each liquid sample was diluted in1 ml of 0.05M 
zinc acetate. Standards were prepared from 1mM anaerobic stock solu- 
tion of Na,S diluted by 0.05 M zinc acetate. The concentration of Na,S 


stock solution was verified by precipitating an exact volume of Na,S 
with an excess volume of 0.3 M silver nitrate. Six hundred microlitres 
of the precipitated sample were transferred and reacted with 10 pl of 
diamine reagent. After 20 min reaction time in the dark, the absorbance 
was measured by a multi-mode reader spectrophotometer (BioTek, 
Synergy 2) at 670 nm. 

The concentrations of sulfate, nitrite and nitrate in the samples of 
the liquid medium from triplicate cultures were determined by ion 
chromatography (Dionex ICS-16000 equipped with an auto-sampler 
Dionex AS-DV, Thermo Fisher), guard column (Dionex Ion Pac AG22, 
RFIC, Guard 2 x 50 mm, Thermo Fisher), analytical column (Dionex Ion 
Pac AS22, RFIC, Analytical 2 x 250 mm, Thermo Fisher, USA) and atrap 
column for metals (Dionex Ion Pac MFC-1, RFIC, trap column, metal 
free, 3 x 27 mm, Thermo Fisher). All samples were filtered anaerobi- 
cally through 0.2-um-pore-size filters (Acrodisc 25-mm syringe filter, 
PALL) and stored at —20 °C. The chloride ion was solid-phase extracted 
from all samples using a Ag/H cartridge (Dionex OnGuard II Ag/H, 
2.5c.c. cartridge, Thermo Fisher) before the analysis. The removal of 
the chloride ion affected the lower detection limit for phosphate, but 
not for sulfate and nitrate. The limits of detection for sulfate, nitrate 
and nitrite, respectively, were 20 pg/I, 20 pg/l and 10 pg/I respectively. 

Total dissolved manganese concentrations in the liquid culture 
medium from triplicate cultures were determined by inductively cou- 
pled plasma-mass spectrometry (ICP-MS, Agilent 7500, Agilent). All 
samples were filtered through 0.2-"m-pore-size filters (Acrodisc 25-mm 
syringe filter, PALL), and acidified with 2% high purity HCI (hydrochloric 
acid 30%, Sigma Aldrich, suprapur- end Millipore, 100318) and stored 
at —20 °C. All samples were diluted with high-purity 2% HCI (hydro- 
chloric acid 30%, Sigma Aldrich, suprapur- EMD Millipore, 100318) 
before the analysis. 

Dissolved manganese in the liquid phase was also measured by leuc- 
oberbelin blue (LBB) assay and iodometric method®™. Oxidized manga- 
nese inthe liquid phase includes any soluble valence state of manganese 
that can pass through the 0.2-um pore filter. We used the iodometric 
method? to determine manganese oxidation state including Mn(11), 
Mn(111), Mn(Iv) and Mn(vIt). Again, none of the measurements detected 
oxidized manganese in the liquid phase. The LBB assay was also used 
to quantify the concentrations of oxidized manganese in biofilms. 
These concentrations were detected as oxidizing equivalents of KMnO, 
(‘Quantification of biofilms and oxidized manganese in biofilms’). 

The concentration of peroxide was measured in triplicate samples 
of microbial biofilms and sterile controls incubated in the light with1 
mM Mn(1I) and 50 uM Na,S using a peroxidase activity assay kit (Sigma 
Aldrich, MAKO92). The standard curve was measured using different 
dilutions of the H,O, standard (Sigma Aldrich, MAKO92C) in sterile 
culture medium mixed with the reaction mix composed of 2 pI fluores- 
cent peroxidase substrate (Sigma Aldrich, MAKO92B) and 48 pl of HRP 
positive control (Sigma Aldrich, MAKO92D). A 100 ul of each diluted 
H,O, standard and the samples were distributed into microplate wells. 
The plate was incubated at 37 °C and the initial measurement (Gpitiat) 
was measured after 3 min by multi-mode reader spectrophotometer 
(BioTek, synergy 2) at 570 nm. The absorbance was measured every 
3 min until the value of the most-active sample exceeded the end linear 
range of the standard curve. We did not detect any H,O, in the incuba- 
tions or sterile controls. The limit of detection of H,O, using colori- 
metric detection was 0.1nM. 


Quantification of biofilms and oxidized manganese in biofilms 

The amount of biofilm was measured by crystal violet staining using a 
modification of a previously published assay. In brief, biofilms from 
triplicate serum bottles were collected at each time point and cen- 
trifuged aerobically at 10,000 rpm for 30 min. The supernatant was 
decanted and 0.5 ml of 0.1% of aqueous crystal violet added (crystal 
violet, Sigma Aldrich, ACS reagent, > 90% anhydrous basis, C6158). The 
stained biofilm was incubated inthe dark at room temperature for 24 h, 


washed 15 times with nanopure water and air-dried. After drying, 0.5 ml 
of 30% acetic acid (37% acetic acid, Sigma Aldrich, ACS reagent, > 99.7%, 
695092) was added tothe samples and left to react at room temperature 
for 30 min. Acetic acid solubilized all crystal violet molecules bound 
to peptidoglycan and exopolysaccharide. Thus, the solubilized crystal 
violet corresponds to the biomass in biofilms. The collected solubilized 
crystal violet was filtered through 0.2-m-pore-size filters (Acrodisc 
25-mm syringe filter, PALL) and 200 pl of the solution was transferred 
into a microtiter plate. The absorbance of the samples was measured 
at 550 nm using a spectrophotometer (BioTek, synergy 2). 

Oxidized manganese in biofilms was quantified by the LBB assay. 
Biofilms were inoculated from frozen stocks into triplicate serum bot- 
tles that contained 25 ml or S50 ml of MFGL medium and incubated for 
one, two or three weeks. Biofilms from the frozen stock (about 5 mg) 
were washed 10 times with anaerobic nanopure water to remove any 
glycerol, inoculated into the culture medium and the medium was 
immediately flushed with 20% CO,/80% N, (v/v) for 1h. The biofilms 
grew for 2 weeks in the light, at which point, biofilms (about 5 mg) were 
transferred into serum bottles that each contained 25 ml of the fresh 
MFGL medium. Three bottles were incubated in the light, three in the 
dark andall biofilms were collected after one or two weeks by pipetting 
and centrifugation in the anaerobic glove box. After the LBB assay, all 
analysed samples were air-dried for >24 hand weighed. The one-week 
old biofilms weighed 16-18 mg; the two-week old biofilms weighed 
19-21 mg. A separate experiment quantified the amount of oxidized 
manganese in duplicate 25-ml cultures of 3-week old biofilms that had 
been inoculated with about 0.1 mg of the washed material from frozen 
culture stocks and weighed <0.3 mg at the end of the experiment. Incon- 
trast to the experiments that yielded samples for XRD, XPS, scanning 
electron microscopy and TEM analyses, these experiments involved 
at most one successive transfer of biofilms after the inoculation from 
frozen stocks of biofilms enriched as described in ‘Enrichment and 
culturing conditions’. 

The working reagent was prepared as 0.04% LBB in 45 mM acetic 
acid and stored at 4 °C overnight ina light-proof container. Potassium 
permanganate (KMnO,) 1 mM stock solution was freshly prepared in 
water and standards (5, 10, 15, 20, 40 and 50 pM) were prepared by 
diluting the stock solution in water. The samples were incubated for 
20 minin 0.75 ml of the 0.04% LBB working reagent in the dark at room 
temperature and centrifuged for 90s at 10,000 rcfto remove the bio- 
film and mineral particulates from the solution. The absorbance of 
the supernatant was measured ona spectrophotometer at 618 nm. To 
determine whether some manganese was oxidized in the dark, about 
5 mg of the biofilm stock was inoculated into sterile MFGL in the dark 
for 2 weeks which detected on average 0.02 uM oxidizing equivalent 
per 5 mg of biofilm. Control experiments assayed the concentration of 
oxidized manganese in FGL enrichment cultures that contained 1mM 
sulfide, and 1mM MnCl, and did not detect any oxidized manganese. 


Oxygen concentration 
To determine how much oxygen can diffuse into the cultures 
through the butyl rubber caps, we used our in-house-devel- 
oped oxygen sensor® based on the fluorescence lifetime” of 
§,10,15,20-tetrakis(pentafluorophenyl)-21H,23H-porphinepalladium 
(II). The sensor can detect changes in the partial pressure of oxygen that 
aresmaller than 1 pbar and its main sensitivity region® is O-100 patm. 
Experiments were conducted to quantify the oxygen concentration 
in the cultures and the maximum amount of oxygen inflow. First, the 
partial pressures of oxygen in the headspaces of photosynthetic cul- 
tures, sterile controls incubated in the light and dark control cultures 
were measured automatically for 14 days. All serum bottles contained 
100 ml of the medium reduced by 50 iM Na,S (Extended Data Fig. 1). 
The partial pressure of oxygen inthe headspaces of the bottles did not 
increase or fluctuate by more than 2 pwbar over the course of the growth 
experiment. The main sources of noise were daily thermal fluctuations 
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(high-frequency component) and sensor aging (low-frequency 
component). The upper limit for the partial pressure of oxygen in the 
headspace is 2 pbar, measured in the beginning of the experiment. 
This partial pressure was lower than 0.5 pbar during most of the experi- 
ment. This corresponds to a maximum dissolved molecular oxygen 
concentration of 2.6 nM, assuming the equilibrium between O, in 
the headspace and O, dissolved in the culture medium according to 
Henry’s law. 

Inan additional test, we incubated biofilms in the light in the anaero- 
bic chamber under a5%CO,/5%H,/balN, (v/v/v) atmosphere. The partial 
pressure of oxygen inthe chamber was below1 ppm, as opposed to 21% 
above the butyl rubber stoppers of the cultures that were incubated 
outside of the chamber. Therefore, orders-of-magnitude-less oxygen 
is expected to diffuse into the cultures. The biofilms were grown with 
and without the addition of 1 mM Mn(11) and the culture medium was 
reduced with 20 uM Na.S. After two weeks of incubation, the biofilms 
were collected and analysed by XRD. Manganese oxides and carbonates 
phases formed in biofilms incubated with Mn(11), and elemental sulfur 
formed in biofilms grown without Mn(II). The formation of detectable 
quantities of manganese oxides in photosynthetic cultures incubated 
inthe anaerobic glove box further demonstrated the negligible role of 
oxygen diffusion in the oxidation of manganese. 


Acquisition of phototrophy in green sulfur bacteria 

Phototrophy within stem-group green sulfur bacteria (GSB) and stem- 
group green non-sulfur bacteria (GNS) could have been acquired at any 
point before their post-GOE diversification events. Without additional 
information, itis not possible to infer where along these branches pho- 
totrophy was acquired, but the evolutionary history of bacteriochlo- 
rophyll biosynthesis may provide a strong clue. Phylogenies of protein 
families involved in bacteriochlorophyll biosynthesis have a complex 
evolutionary history across phototrophic lineages, including gene 
duplications within the stem-group GSB, and multiple horizontal gene 
transfer events between GSB and GNS lineages”. Specifically, the genes 
encoding BchH and BchM weretransferred from within crown-group 
GNS to stem-group GSB, with the gene encoding BchH undergoing a 
duplication shortly before crown-group GSB. Bchl is also observed to 
duplicate inthe GSB stem, with one paralogue being transferred to stem 
GNS. These observations indicate that phototrophy must have existed 
in these lineages at the time of any bacteriochlorophyll-synthesis- 
gene duplications, or any divergence of a horizontal-gene-transfer 
donor lineage. A substantial history of phototrophy within the GSB 
stem lineage can be inferred from these events (Extended Data Fig. 6). 
Future molecular clock studies that include these gene tree histories 
may be able to constrain the time interval for phototrophy in the GSB 
stem; but the Bch protein histories alone suggest that phototrophy 
within GSB existed earlier than the appearance of the GSB or GNS crown 
groups. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Sequence data are available as FASTQ files at the National Center for Bio- 
technology Information (NCBI) via Sequence Read Archive (SRA), under 
the SRA accession number SRP133329. The datasets that support the 
findings of this study are available from the figshare repository (https:// 
figshare.com/), with the identifiers 10.6084/m9.figshare.9738515, 
10.6084/m9.figshare.9738725, 10.6084/m9.figshare.9738776, 10.6084/ 
m9.figshare.9738905, 10.6084/m9.figshare.9738797, 10.6084/ 
m9.figshare.9738878, 10.6084/m9.figshare.9738887, and 10.6084/ 
m9.figshare.9738896. All other supporting data that support the find- 
ings of this study are available from the corresponding authors. 
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Extended Data Fig. 1| Partial pressure of oxygen inthe headspace of 
enrichment cultures and dark controls. Oxygen concentration (atm) 
measured inthe headspaces of 150-ml serum bottles that contained 100 ml of 
MFGL medium, 50 pM sulfide and 1mM MnCl,. One inoculated culture was 
incubated in the light (red points) and another in the dark (blue points). The 
sterile control (green points) was incubated in the light. Individual points are 
measurements by the oxygen sensor taken every 48.2s. To control for sensor 


8 10 12 14 16 


Time (days) 


drift and recalibrate the zero point of the sensor, the bottles were flushed with 
oxygen-free N, on day 14 (black arrow) after the inoculation. The fluorescence 
reading value after the stabilization was set as zero. The diurnal oscillations in 
O, concentration reflect temperature changes induced by the proximity to the 
light bulb witha 12:12 h day:night cycle. Oxygen concentrations in all cultures 
were lower than1nMatall times after about 12 hand before the flushing on 

day 14. All data are representative of two independent measurements. 
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Extended Data Fig. 2| XRD spectra of biofilm samples incubated in the light 
for two weeks. a-d, These biofilms were not treated to remove manganese 
oxides before inoculation. a, One millimolar Mn(11) and 0.05 mM Na,S. b, Mn(11) 
at 0.1mM, and 0.05 mMNa,S.c, One millimolar Mn(11) and1mMNaz,S.d, One 
millimolar Mn(11) and 0.25 mM NaS. Purple line shows the highest intensity 


peak at 26 of 30.870° for the basal reflection of (104) plane of dolomite, 
CaMg(CO,),. Red line shows the highest intensity peak at 20 of 33.867° for the 
basal reflection of (121) plane of CaMnO,. Orange line shows the highest 
intensity peak at 20 of 42.845° for the basal reflection of (110) plane of S°. 

All data are representative of three independent measurements. 
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Extended Data Fig. 3| XPS of the 2p spectral region of manganese in 
two-week-old microbial cultures. a, Biofilm incubated in the dark. The 
Mn2p;,. main peak of the sample fits the MnO standard at binding energy of 
640.90 eV that corresponds to the redox state of Mn(11). b, Biofilm incubated in 
the light. The Mn2p;,, main peak of the sample fits the MnO, standard at 
binding energy of 641.47 eV that corresponds to Mn(II1) and Mn(I1). c, Biofilm 
incubated in the light (a different region to that shown inb and d). The Mn2p;,, 
main peak of the sample fits the Mn,O, standard at binding energy of 641.61 eV 
that corresponds to Mn(111). d, Biofilm incubated in the light (a different region 
to that shown inb, c). The Mn2p;,, main peak of the sample fits MnO, standard 
at binding energy of 641.90 eV that corresponds to redox state of Mn(Iv). 

e, Biofilm incubated in the light (a different region to that showninc, d). 

The Mn2p,,, main peak of the sample fits the CaMnO; standard at binding 
energy of 642.50 eV that corresponds to Mn(IVv). All data are representative of 
three independent measurements. 
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Extended Data Fig. 4 | Test of manganese- oxidizing activity in cell 
suspensions of photosynthetic cultures enriched under two conditions. 
Results of XPS analysis of the 2p spectral region of manganese are shown. 

a, Culture enriched on1mM Mn(I1) and 0.05 mMH,S (condition (1)). The 
Mn2p;,. main peak of the sample fits Mn;O, standard at binding energy of 
641.41 eV. This corresponds to Mn(I1) and Mn(111). b, Culture enriched on1mM 
H,S (condition (2)). The Mn2p,,, main peak of the sample fits MnO standard at 
binding energy of 640.97 eV and corresponds to the redox state Mn(II).A 
detailed experimental protocol is described in ‘Interpretation of XPS spectra’ 
inthe Methods, and summarized in Extended Data Table 3. All data are 
representative of three independent measurements. 
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Extended Data Fig. 5 | Test of manganese-oxidizing activity in cell 
suspensions of pure cultures and co-cultures of C. limicola, C. tepidum and 
G. lovleyi.a—d, Results of XPS analysis of the 2p spectral region of manganese 
are shown.a, C. limicola, C. tepidum andG. lovleyi. The Mn2p3,. main peak of the 
sample fits Mn,O, standard at binding energy of 641.65 eV. This corresponds to 
avalence state of Mn(III). b, C. limicola and C. tepidum. The Mn2p;,, main peak 
of the sample fits MnCO; standard at binding energy of 640.55 eV and 
corresponds to the redox state Mn(ID). c, C. limicola and G. lovleyi. The Mn2p;3, 
main peak of the sample fits Mn,O, standard at binding energy of 641.36eV. 
d,C. tepidumand G. lovleyi. The Mn2p;,. main peak of the sample fits MnCO3 
standard at binding energy of 640.57 eV. A detailed experimental protocolis 
described in ‘Probing the redox state of manganese in co-cultures’ in the 
Methods. All co-cultures were grown with 1mM Mn(11) and 0.05 mMH,S for 

2 weeks in the light. All data are representative of three independent 
measurements. 


2500- 3000 Ma 1650 - 2450 Ma 
Stem age Crown age 


| phototrophy 
present 


<— 


<— 


| phototrophy 
2250 - 2850 Ma present 1650 - 1950 Ma 
Stem age Crown age 
Extended Data Fig. 6|Reticulate history of bacteriochlorophyll- bacteriochlorophyll genes were taken froma previous publication”. Age 
biosynthesis genes supports along history of phototrophy inthe estimates for crown Chlorobiand GNS groups were taken froma previous 


Chlorobiales stem lineage. Horizontal gene transfers and geneduplicationsof —_ publication”®. 


Article 


Extended Data Table 1| Aquatic environments with H,S and manganese in the photic zone 


Water body 
Lake A 

Lake Vanda 
Garrow Lake 
Sombre Lake 
Svetloe Lake 
Black Sea 
Green Lake 


Data are from previous publications 


868-73 


HS (uM) 
230 
240 

20 
12 
2 
2 
20-30 


Mn(I1) (uM) 
140 


Photic Zone 
Green Sulfur Bacteria 
Unclear 
Green Sulfur Bacteria 
Green Sulfur Bacteria 
Green Sulfur Bacteria 
Green Sulfur Bacteria 
Green Sulfur Bacteria 


Reference 
68 


69 


Extended Data Table 2 | Shake tube and transfer procedures used to obtain enrichments from conditions 1 and 2 


Incubation Incubation Growth! Enrichment 
First round period, Transfer into period 
shake tube (days) Colonies liquid medium (days) 
20 uM NavS, 1 Risck anal Brown colony; - Condition | 
mM MnCl 2 dark brown ODE TUM Noes a 
mM MnCh 
Black, Riawachane-t At Condition 2 
1 mM NaS 30 white and oe 30 
mM NaS 
dark brown 
Growth! Enrichment Microbes 
Incubation present in 
Second round Incubation Transfer into period enrichment 
shake tube period (days) _ Colonies liquid medium (days) culture” 
+ Condition 1 = Chlorobium 
Brown colony; limicola, 
ies ari ; 30 pace 0.02 mM NabS, 1 30 Geobacter 
mM MnCh lovleyi, 
others" 
+++ Condition 2  Chlorobium 
Brown, Brown colony; 1 limicola, 
rae eo dark brown mM Nas. 20 Desulfomicro 


bium Sp. 


'Growth of colonies transferred from shake to a liquid medium of the same chemical composition. +, low microbial growth; +++, high microbial growth. 
Microbial composition determined by metagenomic sequencing. 
5A. equifetale, Alistipes sp. HGBS5, and C. oshimai. 
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Extended Data Table 3 | Summary of the methods used to examine the redox state of manganese in enrichment cultures 


Transfer MFGL Incubation Mn 
Enrichment Transfer medium time oxidation 
condition condition composition (days) Growth! activity” 
i Cell 0.02 mM NaS 
: z . a 
ponetion suspension ImM MnCl. d 
a Cell 0.02 mM Naps, 1 
Conainen? suspension mM MnCh 2 ; ; 
et 5% 0.02 mM Naos, 
Condition 1 inoculum ImM MnCl ‘ - 
ie 5% 0.02 mM Naos, 
Condition 2 inoculum ImM MnCl " - 7 


"+ or - indicates visible growth (+) of the co-culture, or not (-) 


?Manganese oxidation in the enrichment cultures was examined using XPS; + or -, presence (+) or absence (-) of Mn(\i)-oxidizing activity in the microbial co-culture. 


Extended Data Table 4 | Mn(i!)-oxidation genes with confirmed function compared using BLASTp v. 2.6.0+ against the 
genome of C. limicola SR-12 


Locus ID 
PputGB1_ 3353 
WP_007817484 
WP_009209951 
WP_006837219 


WP_076798083 
AFL56752 
CAJ19378 


NP_ 745328 
EG12318 


Allanimal haem peroxidases also hit the haemolysin-type calcium-binding region in C. limicola (e = 3 x 10", bit score = 98.6). Data are from previous publications 


Gene Annotation 
animal heme peroxidase 
animal heme peroxidase 
animal heme peroxidase 
multi-copper oxidase 
multi-copper oxidase / 
billirubin oxidase 
multi-copper oxidase, type 2 
multi-copper oxidase 
multi-copper oxidase / 
billirubin oxidase 
multi-copper oxidase 


Gene 
Name 


mopA 
ahpL 

mopA 
mnxG 


boxA 
cotA 
moxA 


mcoA 
cueO 


Organism 

Pseudomonas putida 
Roseobacter sp. AzwK-3b 
Aurantimonas manganoxydans 
Bacillus sp. strain SG-1 


Arthrobacter sp. QXT-31 
Bacillus pumilus WH4 
Pedomicrobium sp. ACM 3067 


Pseudomonas putida 
Escherichia coli 


% 
identity 
46 

40 

47 

no hits 


no hits 
no hits 
no hits 


no hits 
no hits 


e. value 
1.0E-08 
7.0E-06 
1.0E-07 


bit 
score 
57.4 
48.5 
53.5 
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The emergence and spread of drug-resistant Plasmodium falciparum impedes global 
efforts to control and eliminate malaria. For decades, treatment of malaria has relied 
on chloroquine (CQ), asafe and affordable 4-aminoquinoline that was highly effective 
against intra-erythrocytic asexual blood-stage parasites, until resistance arose in 
Southeast Asia and South America and spread worldwide’. Clinical resistance to the 
chemically related current first-line combination drug piperaquine (PPQ) has now 
emerged regionally, reducing its efficacy”. Resistance to CQ and PPQ has been 
associated with distinct sets of point mutations in the P. falciparum CQ-resistance 
transporter PfCRT, a49-kDa member of the drug/metabolite transporter superfamily 
that traverses the membrane of the acidic digestive vacuole of the parasite® °. Here we 
present the structure, at 3.2 Aresolution, of the PfCRT isoform of CQ-resistant, PPQ- 
sensitive South American 7G8 parasites, using single-particle cryo-electron 
microscopy and antigen-binding fragment technology. Mutations that contribute to 
CQand PPQ resistance localize primarily to moderately conserved sites on distinct 
helices that line a central negatively charged cavity, indicating that this cavity is the 
principal site of interaction with the positively charged CQ and PPQ. Binding and 
transport studies reveal that the 7G8 isoform binds both drugs with comparable 
affinities, and that these drugs are mutually competitive. The 7G8 isoform transports 
CQinamembrane potential- and pH-dependent manner, consistent with an active 
efflux mechanism that drives CQ resistance’, but does not transport PPQ. Functional 
studies on the newly emerging PfCRT F145I and C350R mutations, associated with 
decreased PPQ susceptibility in Asia and South America, respectively’, reveal their 
ability to mediate PPQ transport in 7G8 variant proteins and to confer resistance in 
gene-edited parasites. Structural, functional and in silico analyses suggest that 
distinct mechanistic features mediate the resistance to CQ and PPQ in PfCRT variants. 
These data provide atomic-level insights into the molecular mechanism of this key 
mediator of antimalarial treatment failures. 


Recent reductions in the global malaria burden have stalled, and the 
situation in Southeast Asia is exacerbated by the emerging resistance to 
artemisinin-based combination therapies, including the bis-4-amino- 
quinoline partner drug PPQ”. A major determinant of drug resistance 
in malaria parasites is the P_ falciparum transporter PfCRT>. Amino 
acid substitutions in this protein comprise haplotypes that originated 
independently decades ago in several regions that were subject to 


intense drug pressure with CQ, the former gold-standard antimalarial 
drug. These include the five-amino acid 7G8 variant that dominates in 
South America and the Western Pacific, and the eight-amino acid Dd2 
variant that is prevalent in Southeast Asia (Extended Data Fig. 1). Recent 
extensive use of PPQ in Cambodia is suspected to have driven the rapid 
emergence of novel mutations in pfcrt, giving rise to the Dd2 isoform®®. 
These mutations have become widespread across Southeast Asia, where 
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Fig. 1| Single-particle cryo-EM structure of PfCRT 7G8. a, PfCRT (PlasmoDB 
PF3D7_0709000) is localized within the membrane of the digestive vacuole 
(DV) of the P. falciparum intra-erythrocytic parasite, in which imported host 
haemoglobin (Hb) is catabolized and toxic free haem is released. CQ and PPQ 
are believed to concentrate in the digestive vacuole as protonated species 
(CQ** and PPQ*) that bind to haem and prevent its incorporation into non-toxic 
haemozoin®. In CQ-R parasites, PFCRT is thought to efflux CQ out of the 
digestive vacuole into the cytosol away from its haem target. b, The3.2Acryo- 
EM structure of PfCRT 7G8, with the 10 TM helices coloured in rainbow. TheN 
and C termini are labelled. Bottom, the structure of PfCRT 7G8 rotated 90° 
compared tothe top, as viewed from the digestive vacuole side. TM and 

JM helices are numbered. c, Topology of PfCRT highlighting the inverted 
antiparallel repeats of TM1-TM4 and TM6-TM9 (shownas grey trapezoids). 
Disordered regions are shown as dotted lines. TM1-TM10 andJM1andJM2 are 
labelled. TM1-TM4 and TM6-TM9 surround the central cavity. d, Surface 
representation of the electrostatic potential of the central cavity with red and 
blue indicating negatively and positively charged residues, respectively. On 
the right, acentral slice through the structure (dotted lines) as aninset shows 
the arrangement of TMhelices, labelled from N terminus to Cterminus. 


they are associated with dihydroartemisinin—PPQ treatment failures 
that now occur, on average, in50% of cases in the region and reach 87% 
of cases in northeastern Thailand’. CQ and PPQ are thought to act by 
accumulating in the acidic digestive vacuole of the intra-erythrocytic 
parasite as protonated species (CQ** and PPQ**). These drugs bind to 
toxic Fe**-haem, which is released from proteolysed host haemoglobin, 
and inhibit haem incorporation into chemically inert haemozoin"”. 
Resistance to CQ has been attributed to mutant PfCRT-mediated drug 
efflux out of the digestive vacuole* " (Fig. 1a), whereas the mechanism 
of PPQ resistance has remained unclear. 

We used single-particle cryo-electron microscopy (cryo-EM) to 
determine the structure of the 49-kDa PfCRT 7G8 isoform, which was 
identified to be the most suitable candidate froma panel of 12 sequence 
variants and/or orthologues expressed in HEK293 cells. To overcome 
current cryo-EM size limitations”, we screened a synthetic phage dis- 
play library’® to select for recombinant PfCRT-specific antigen-binding 
fragments (Fabs, around 50 kDa). PfCRT complexes were obtained with 
several candidates, including the Fab CTC that yielded high-quality 
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two-dimensional class averages obtained by negative-stain electron 
microscopy (Extended Data Fig. 2). 

We solved the structure of the nanodisc-incorporated PfCRT 7G8- 
Fab CTC complex to anominal resolution of 3.3 A, with the PfCRT por- 
tion resolved to 3.2 A (Extended Data Table 1 and Extended Data Fig. 3). 
We built a de novo model of PfCRT comprising residues 47-113 and 
123-405 of the 424-amino acid protein (Fig. 1b, c). PfCRT appears to be 
monomeric and consists of 10 transmembrane (TM) helices (Fig. 1b). 
Two juxtamembrane (JM) helices (JM1 and JM2), parallel to and on 
opposite sides of the digestive vacuole membrane, are located at the 
cytosolic N terminus and between TM7 and TM&8, respectively (Fig. 1b, 
cand Extended Data Fig. 4). The 10 TM helices are arranged as five 
helical pairs that form two-helix hairpins with an inverted antiparal- 
lel topology (Extended Data Fig. 5a). TM1-TM4 and TM6-TM9 forma 
central cavity of around 3,300 A? that is wider onthe digestive vacuole 
side and closes approximately halfway into the membrane, suggest- 
ing that the structure is in an open-to-digestive-vacuole/inward-open 
conformation (Fig. 1b, d). The pseudo-symmetrical arrangement of the 
TM helices also enabled us to model the alternative open-to-cytosol 
conformation of TM1-TM10 (Extended Data Fig. 5b). 

Structural conservation between PfCRT 7G8 and other members 
of the drug/metabolite transporter (DMT) superfamily with known 
structure is high in the TM helices, with a root mean square deviation 
of 2.8-4.0 A between backbone atoms (Extended Data Fig. 5c, d). The 
central cavity in the structurally conserved DMT proteins constitutes 
the known ligand-binding site (Extended Data Fig. 5d), indicating that 
this cavity is probably the equivalent site in PfCRT. The net charge in 
the PfCRT 7G8 cavity is negative (Fig. 1d and Extended Data Fig. 5d), 
influenced mainly by D137, D326 and D329 (Extended Data Fig. 6a), 
suggesting that the putative substrates of PfCRT 7G8 are probably posi- 
tively charged. Consistent with this proposal, the positively charged 
R111 residue in the variable loop region of the Fab CTC protrudes into 
the PfCRT cavity (Extended Data Fig. 6a). Interactions between Fab 
R111 and PfCRT D326 and D329 presumably help to stabilize the PfCRT 
7G8-Fab complex. We also identified density in a separate hydrophobic 
cleft—which is delineated by JM1, TM1, TM9 and TM10-that is unique to 
PfCRT and near the cytosolic side of the digestive vacuole membrane. 
We tentatively assigned this to cholesteryl hemisuccinate, which was 
present during the purification (Extended Data Fig. 6b-d). 

The PfCRT central cavity, with amaximum diameter of approximately 
25 A, isin principle able to accommodate CQ or PPQ (which have maxi- 
mum lengths of approximately 14 and 21A, respectively; Extended Data 
Fig. 6e). All CQ-resistant (CQ-R) isoforms, irrespective of geographical 
origin, share acommon and essential K76T mutation™”, which our 
structure places directly in the lining of the cavity (Fig. 2). Compared 
with the canonical wild-type CQ-sensitive (CQ-S) 3D7 isoform, the CQ-R 
7G8 isoform has four additional mutations (C72S, A220S, N326D and 
1356L) that also line the cavity (Fig. 2 and Extended Data Table 2a). 
Similarly, the CQ-R Dd2 isoform has six of its eight mutations (M741, 
N75E, K76T, A220S, N326S and I356T) located within the cavity, and 
two additional mutations (Q271E and R371) are located on the loops 
at the cavity entrance on the digestive vacuole side (Fig. 2). Distinct 
amino acid substitutions associated with PPQ-R also line the cavity, 
including the C350R mutation that arose in the 7G8 isoform, and H97Y, 
F1451 and G353V, as well as the recently described mutations T93S and 
1218F, which each emerged separately in the Dd2 isoform”* ° (Fig. 2 and 
Extended Data Table 2a). M343L, which confers low-grade PPQ resist- 
ance in the Dd2 isoform®, lies deeper into the central cavity, close to 
the parasite cytosol (Fig. 2). 

CQ resistance-associated mutations are confined mostly toa single 
region that includes TM1, TM6, TM8 and TM, in contrast to the PPQ 
mutations that are positioned primarily in two distinct regions that 
involve TM2, TM3, TM6 and TM9 (Extended Data Fig. 6f). All PPQ-R 
point mutations arose individually in the CQ-R Dd2 or 7G8 isoforms, 
indicating that the interaction with the larger and more positively 
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Fig. 2| Mapping of drug-resistance mutations onto the PfCRT structure. 
Residues known to contribute to resistance to CQ and PPQ (Extended Data 
Fig. 1and Extended Data Table 2) inthe 7G8 and Dd2 isoforms are mapped onto 
the 7G8 structure. Mutations have their side chains rendered as sticks and are 
coloured based on their associated resistance profiles. The remaining 
structures are rendered in cartoon and coloured in grey. Views are shown 
vertically (digestive vacuole to the bottom) and from the digestive vacuole 
side. 


charged PPQ* requires additional modifications to the central cav- 
ity. Potentially, the physical separation between the sets of mutated 
helices in the PPQ-R isoforms might correlate with the double 4-ami- 
noquinoline ring structure of PPQ, in contrast to the pattern observed 
for CQ, which contains a single 4-aminoquinoline ring (Extended Data 
Fig. 6e, f). No mutations in PfCRT associated with resistance to either 
drug mapped to residues that were fully conserved across our set of 
apicomplexan CRT orthologues (Extended Data Figs. 6f, 7 and Extended 
Data Table 2a), suggesting that there are structural and/or functional 
constraints. Resistance-conferring mutations map to moderately 
conserved sites in select helices in residues that line the cavity, thus 
providing a road map of sites for future PPQ resistance surveillance. 
For example, residues with these properties on TM2 are at positions 
90, 93, 94, 97, 98 and 101, among which T93S, H97Y and C101F have 
each been associated with PPQ resistance® *”. By contrast, mutations 
previously found in Southeast Asia that have not recently expanded in 
areas of PPQ treatment include L196P, N2951, A366T and G367C, which 
do not meet the above criteria®’. 

To translate structural observations into transporter function, we 
first used purified protein in scintillation proximity-based assays to 
measure drug binding”’. Results with PfCRT 7G8 reconstituted in nano- 
discs showed binding to [7H]CQ and [?H]PPQ, at comparable levels 
(Fig. 3a). Isotopic dilution assays yielded mean half-maximal binding 
constants (EC,,.) of approximately 300 nM for CQ, similar to previous 
studies with Dd2 PfCRT”, and around 190 nM for PPQ (Fig. 3b). Each 
drug competitively inhibited binding of the other with nearly identical 
mean half-maximal inhibition constants (IC;, values) of approximately 
170 nM (Fig. 3c). 

Binding of these two drugs was partially inhibited by verapamil 
(Fig. 3d), an agent that is known to reverse CQ resistance”. Near-com- 
plete inhibition was observed with amodiaquine, a related 4-amino- 
quinoline first-line partner drug against which PfCRT 7G8 mediates 
cross-resistance”, Control assays found no inhibition of 7H]CQ or PH] 
PPQ binding with the chemically distinct antimalarial drugs lumefan- 
trine and atovaquone, which have separate modes of action’. 


We also observed inhibition of [7H]CQ and [?H]PPQ binding with 
excess arginine, consistent with the observed interaction between 
R111 of the Fab CTC and the PfCRT cavity (Extended Data Fig. 6a), and 
previous [?H]Arg transport studies™. Control experiments revealed no 
inhibition of drug binding with the neutral amino acid leucine (Fig. 3d). 
Results of the scintillation proximity-based assays showed binding of 
PH]Arg to PFCRT 7G8 with an EC;, of approximately 400 1M (Extended 
Data Fig. 8a, b). This binding was effectively competed by both CQ and 
PPQ, as well as by amodiaquine and non-radiolabelled arginine or lysine, 
but not by leucine (Extended Data Fig. 8c). Binding of PH]Arg, PH]CQ 
and [?H]PPQ was inhibited by the Fab CTC inaconcentration-dependent 
manner (Fig. 3e), suggesting that Arg, CQ, PPQ and the Fab overlap in 
their binding in the PfCRT cavity. 

We next showed that active uptake of [7H]CQ and PH]Arg required an 
inwardly directed pH gradient (pH 5.5-7.5, corresponding to the diges- 
tive vacuole pH gradient of the parasite”), and a membrane potential 
AW (Extended Data Fig. 8d, e). Here, AY was created by using the K*- 
specific ionophore valinomycin to generate an outward-directed K* 
diffusion gradient from the proteoliposomes. Under these conditions, 
uptake of [7H]CQ and PH]Arg was maximal within 1 min (Fig. 3f). Kinetic 
characterization of the concentration-dependent uptake of ?H]CQ 
and [?H]Arg yielded Michaelis-Menten constants (K,,) of 0.8 1M and 
1.3 uM and maximum velocities of transport (V,,,,,) of approximately 
90 nmol mg? min“ and 190 nmol mg? min“, respectively (Fig. 3f). 
Assuming full activity of PPCRT 7G8 reconstituted in proteoliposomes, 
these V,,,,, values reflect catalytic turnover numbers (k,,,) of around 
0.2 stand about 0.4 s7, respectively; values that are characteristic 
for secondary active transporters rather than for channels”. Collec- 
tively, these results indicate that proton-motive force-dependent CQ 
efflux is the mechanism that underlies mutant PfCRT 7G8-mediated 
resistance to CQ, inagreement with previous reports*” >”*, Under our 
experimental conditions, no detectable [7H]PPQ uptake was observed 
with PfCRT 7G8 in proteoliposomes, as expected for this PPQ-sensitive 
(PPQ-S) isoform (Fig. 3f). We also observed partial-to-strong inhibition 
of ?H]CQ and PH]Arg uptake by verapamil and 4-aminoquinolines, 
further implicating a crucial role for the central cavity in substrate 
recognition (Fig. 3g). 

Our structural analysis reveals four cysteines in the long loop 
between TM7 and TM8, located on the digestive vacuole side near the 
entrance to the cavity. These cysteines, which appear to form the disul- 
phide bridges Cys289-Cys312 and Cys301-Cys309 (located next to 
JM2 andatthetip of the loop, respectively), are completely conserved 
(Extended Data Figs. 7, 9a), suggesting that these cysteines might have 
a potential redox-dependent mechanistic role. Of note, P. falciparum 
intra-erythrocytic parasites are subjected to oxidative stress during 
parasite-mediated haemoglobin proteolysis and Fe”*-haem oxidation®. 
We expressed and purified 7G8 variants that expressed either aC301A 
or C289A mutation. We observed equivalent [7H]CQ binding for these 
variants and 7G8, which was unaffected by increasing concentrations of 
the reducing agent B-mercaptoethanol (Extended Data Fig. 9b). [7H]CQ 
uptake was also similar between these three isoforms inthe absence of 
areducing agent. Increasing the concentration of 8-mercaptoethanol 
led to reduced uptake of CQ by all three tested isoforms (Extended Data 
Fig. 9c). The reduced dose dependency of the 7G8 + C301A isoform 
in respect to 7G8 and 7G8 + C289A indicates that the Cys289-Cys312 
pair is less sensitive to reducing conditions that appear to regulate 
CQ transport. 

To relate PfCRT structure and function to resistance mechanisms, 
we leveraged mutations in Dd2 and 7G8 isolates that were implicated 
in differential CQ and PPQ susceptibilities. We chose the F145 muta- 
tion, which mediates high-grade resistance to PPQ in Dd2 parasites and 
which is associated with a substantially increased risk of failure of the 
dihydroartemisinin-PPQ treatment in Southeast Asia”*’””. We hypoth- 
esized that this mutation might also lead to PPQ resistance in the 7G8 
(South American and Western Pacific) background. We also selected 
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Fig. 3 | Functional characterization of PfCRT isoforms. a, Binding of 370 nM 
[H]CQ (red) or 75 nM [PH]PPQ (blue) to PfCRT 7G8 with or without imidazole. 
cpm, counts per minute. Data are mean +s.e.m. of n=3 independent 
experiments; grey symbols are the data mean of technical triplicates of each 
independent experiment to show the data distribution. b, Isotopic dilution of 
(PH]CQ and PH]PPQ revealed log(EC;,) =-6.53+ 0.04 and 

log(EC;) =-6.70 + 0.05 (corresponding to means of 297 nM for CQand 190 nM 
for PPQ), respectively. Data are mean +s.e.m. ofn=3 independent experiments 
performedas technical triplicates. c, Competition of ?H]CQ and PH]PPQ 
binding with non-radiolabelled PPQ and CQ, respectively, revealed 

log(IC;9) =-6.77 + 0.03 and log(IC,.) =-6.78 + 0.06 (corresponding to means of 
171nM and 167 nM), respectively. Dataare mean +s.e.m. of n=3 independent 
experiments performed as technical triplicates. d, Specific binding of PH]CQ 
or PH]PPQ performed in the absence (-) or presence of 10 1M verapamil (VP), 
1p.M amodiaquine (ADQ), 10 1M lumefantrine (LMF) or atovaquone (ATQ), or 
1mM Arg orLeu. Nanodiscs containing LeuT” served as controls. Data are 
mean+s.e.m. of n=3 independent experiments; grey symbols show the data 
mean of technical triplicates of each independent experiment. e, Fab CTC 
binding to the PfCRT 7G8 isoform reduces binding of 370 nM [H]CQ, 75 nM [PH] 
PPQ or 250 nM [H]Arginaconcentration-dependent manner, yielding IC., 
values of 0.15 + 0.02 1M, 0.20 + 0.02 uM or 0.18 + 0.02 uM, respectively. Data 
are meants.e.m. of n=3 independent experiments of technical triplicates. 

f, Time course of 93 nM PH]CQ, 75 nM [?H]PPQ, or 125nM [PH]Arg uptake 
measured with PfCRT 7G8-containing proteoliposomes. Data are mean+s.e.m. 
of n=2independent experiments performed as technical triplicates. Inset, 
transport kinetics of PfCRT 7G8 for [7H]CQ and PH] Arg in proteoliposomes 
revealed Michaelis-Menten constants (K,,) of 0.77 + 0.15 uMand1.30+0.20 1M 
withamaximum velocity of transport (V,,,,) of 92.2+ 4.9 nmol mg! min‘ and 
189.2+9.1nmol mg™min" for the uptake of CQ and Arg, respectively. Data show 
means of n=2 independent experiments of technical replicates (n=4) and the 
kinetic constants represent the mean +s.e.m. of the global fits. g, Uptake of 
370 nM PH]CQ or 250 nM [PH]Arg was measured for 1 minin the presence or 
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absence of 11M Fab CTC, 10 pM verapamil, CQ, PPQ or amodiaquine, or1mM 
Leu. Values were normalized to the signal in the absence of the non- 
radiolabelled compound. Data are mean ¢+s.e.m. ofn=3 independent 
experiments; grey symbols show the data mean of technical triplicates of each 
independent experiment. a-e, Data are normalized tothe specific signal (total 
counts per minute minus counts per minute in the presence of imidazole) in the 
absence of the respective non-radiolabelled compound. h, Saturation binding 
of PH]CQ (red) or PH]PPQ (blue) by indicated PfCRT variants reconstituted in 
nanodiscs was performed using equilibrium dialysis at pH 5.5. Dissociation 
constants (K,) are shown as mean +s.e.m. calculated from global nonlinear 
regression fitting (n=3 independent experiments). i, Uptake of 10O nM PH]CQ 
(red) or PH]PPQ (blue) was measured for 1 min in proteoliposomes containing 
the indicated PfCRT variants or incontrol liposomes. Data are means of n=2 
independent experiments; grey symbols show the data mean of technical 
replicates (n= 4) of eachindependent experiment.j, Kinetic characterization 
of PH]CQ (red) or PH]PPQ (blue) uptake in proteoliposomes containing the 
indicated PfCRT variants. The initial rate of transport was measured for 
periods of 3s using [>H]CQ or PH]PPQ concentrations ranging from 0.05 to 

10 uM. Data (means of n=2 independent experiments of technical replicates 
(n=4)) were fitted to the Michaelis-Menten equation. TheX,, and V,,,, values 
are shownas mean +s.e.m. of the fit in Extended Data Table 2b. Transport 
kinetics of PfCRT 7G8, PfCRT Dd2 or wild-type PfCRT for PPQ or wild-type 
PfCRT for CQ were not determined due to low signal-to-noise ratios. k, H]CQ 
and PH]PPQ CARs at t= 60 min for edited and reference parasite lines. Data are 
mean +s.e.m. of n=4-6 independent experiments performed in duplicate 
(n=2).*P<0.05;**P< 0.01 (using two-tailed Mann-Whitney U-tests). 

I, Percentage parasite survival of pfcrt-edited and reference lines following 
exposure to PPQ for 48 h. Survival was calculated for each line by dividing the 
parasitaemia of the PPQ-treated parasites with that of the no-drug control. 
WT, wild type. Data are mean +s.e.m. for n=2-5 independent assays performed 
in duplicate (n= 2). Statistical significance was determined by two-tailed 
Mann-Whitney U-tests for comparisons in whichn > 2. 


the C350R mutation, which is found in 7G8 parasites and which has 
previously been proposed to have arisen from local PPQ drug pressure’. 

First, we expressed the recombinant PfCRT variants 7G8, 7G8+F145l, 
7G8 + C350R and Dd2, as well as the canonical CQ-S and PPQ-S wild-type 
isoform (from 3D7 parasites). These purified proteins were reconsti- 
tuted in nanodiscs and tested in equilibrium dialysis experiments for 
[?H]CQ and [?H]PPQ binding. Results showed a comparable dissocia- 
tion constant (K,) across all variants for both drugs at pH 7.5 (Extended 
Data Table 2b). However, at pH5.5, which reflects the pH of the acidic 
digestive vacuole”, we observed an approximately twofold increase in 
the PPQ* K, (thatis, an approximately twofold lower binding affinity) 
for the 7G8 + F145] and 7G8 + C350R variants compared with the 7G8 
parental isoform (Fig. 3h and Extended Data Table 2b). Testing PPQ 
uptake in PfCRT proteoliposomes revealed substantially increased PPQ 
transport rates by the 7G8 + F145] and 7G8 + C35OR variants (Fig. 3i,j 
and Extended Data Table 2b). Minimal PPQ transport was observed 
with the PPQ-S isoforms 7G8, Dd2 and wild-type PfCRT. Whereas the 
lowtransport activity of the 7G8 and Dd2 parental isoforms correlated 
with the higher PPQ* binding affinity at pH 5.5 (Fig. 3h), this relation- 
ship was not observed for wild-type PfCRT (which exhibits a low PPQ** 
binding affinity and minimal PPQ transport activity). For CQ, results 
showed robust transport activity with the CQ-R 7G8 and Dd2 parental 
isoforms, reduced uptake by the 7G8 + F145I and 7G8 + C35OR variants, 
and minimal uptake by wild-type PfCRT (Fig. 3i). For the 7G8 + C350R 
variant, the higher CQ” binding affinity (Fig. 3h) paralleled reduced 
CQ transport (Fig. 3i). However, for the wild-type protein the low CQ 
transport activity could not be attributed to tight CQ” binding that 
would potentially impede the effective dissociation of the drug from 
the transporter. Instead we propose that the presence of positive K76 in 
wild-type PfCRT inhibits the required drug-PfCRT interactions within 
the cavity to enable transport. In summary, these biochemical data 
provide evidence that both the F1451 and C350R mutations can medi- 
ate PPQ efflux when introduced into the 7G8 isoform, concomitant 
with reduced CQ transport. These data also suggest that drug-specific 
transport, differentially mediated by PfCRT mutations that occur at dis- 
tinct sites in the central cavity, isthe major determinant of resistance. 

To correlate these findings to drug susceptibility in the parasite, 
we genetically edited pfcrt in the P_ falciparum 7G8 line to express the 
7G8 + F145I or 7G8 + C350R variants and compared these to control 
isogenic 7G8 parasites that express the 7G8 isoform. Their characteri- 
zation revealed a significant increase in the intracellular accumulation 
ratio (CAR) for CQ after 1h in 7G8 + F145I and 7G8 + C350R clones, 
compared to the CQ-R 7G8 and Dd2 lines that showed a minimal CAR 
(Fig. 3k and Extended Data Table 3). Wild-type (3D7) CQ-S parasites 
showed high levels of CQ accumulation. For PPQ, the F145I and C350R 
variants showed a decrease in the CAR compared to the 7G8 parent, 
although the difference was smaller than for CQ. PPQ accumulation 
levels were uniformly high for the PPQ-S 7G8, Dd2 and 3D7 parasites. 
These CAR data are consistent with 72-h parasite growth-inhibition 
assays that showed reversal of CQ resistance upon introduction of the 
F145] and C350R mutations into 7G8 parasites, mirrored by increased 
PPQIC,, and IC,y values in these variants (Extended Data Table 3). The 
7G8 + F145I mutant showed a pronounced biphasic dose-response 
curve, with IC,, values showing a 188-fold increase compared to the 
isogenic 7G8 control line. A similar fold increase in PPQ IC, values was 
observed in parallel studies that compared the Dd2 + F145I mutant with 
isogenic Dd2 parasites (Extended Data Fig. 9d). These findings provide 
evidence that the F145I mutation affords comparable high-grade PPQ 
resistance at elevated PPQ concentrations in these two genetically and 
geographically distinct backgrounds® (Extended Data Fig. 9d). 

As stated above, PPQ resistance in parasites is evident only at high 
nanomolar PPQ concentrations®”’. Accordingly, PPQ resistance is best 
examined using PPQ survival assays, in which synchronized ring-stage 
parasites (0-6 h after invasion of erythrocytes) are exposed to high con- 
centrations of PPQ for 48 h and the percentage survival of parasites is 


measured 24 h later as a percentage of surviving drug-treated parasites 
compared to mock-treated parasites”. Results from this assay revealed 
asurvival of 9-10% at the standard assay concentration of 200 nM PPQ, 
and similar levels of survival were maintained at PPQ concentrations 
of up to 1,600 nM in 7G8 + F145] or 7G8 + C350R parasites (Fig. 31 and 
Extended Data Table 3). PPQ-S parasites showed 1% survival or less at 
concentrations of 200 nM or higher. These results are in line with our 
biochemical studies, which showed increased PPQ transport, accom- 
panied by diminished CQ transport, after the introduction of the Asian 
F1451l and South American C350R mutations into the 7G8 isoform. These 
findings provide compelling evidence that these PfCRT mutations 
could affect PPQ efficacy in South America. 

Tolink our structural and functional findings, we calculated the elec- 
trostatic potential surfaces of the central cavity for the PfCRT isoforms 
described above. 7G8 and Dd2 exhibit the most electronegativity in the 
central cavity, consistent with their highest affinity for PPQ at pH 5.5 
(Fig. 3h and Extended Data Fig. 10a). Assuming PPQ* formation at 
pHS5.5, these variants would feature the strongest electrostatic interac- 
tion of PPQ* inthe PfCRT cavity. Introducing an arginine into this cavity 
(7G8 + C350R) caused a reduction in its electronegativity (Extended 
Data Fig. 10b), which could explain the reduced binding affinity for PPQ 
and the resulting transport-positive phenotype of this variant (Fig. 3h— 
j). Inthe case of PfCRT 7G8 + F1451, however, the electrostatic charge in 
the central cavity appeared to be unchanged when calculated on the 
7G8 model. Molecular dynamics simulations suggested that the F145] 
mutation induced a conformational change in PfCRT, which involved 
substantial displacement of TMI, with 1451 (TM3) showing greater 
atomic distance from L83 (TM1) compared to F145 in 7G8 (Extended 
Data Fig. 10c). TM7 was also notably displaced towards the 145] variant, 
as shown by the distance to V263. By contrast, marginal movement was 
observed with the C350R mutation (Extended Data Fig. 10d). These 
simulations suggest that changes in the cavity shape and size, which 
result from the F145I mutation, could affect molecular interactions that 
are essential for the differential transport of CQ and PPQ. 

Our analyses highlight a complex mechanism of resistance that is 
dependent on the gain of CQ” or PPQ* efflux through specific muta- 
tions inthe central cavity of PfCRT, which evolved under regional selec- 
tive drug pressures that began decades ago with the former first-line 
drug CQ and, more recently, involves PPQ. These mutation-specific 
changes can differentially influence the electrostatic and other phys- 
icochemical properties of the cavity, affecting drug binding and/or the 
structural dynamics that are associated with transport®°. The apparent 
dichotomy observed between CQ and PPQ for the F145I and C350R 
mutants, which evolved in CQ-Risoforms and caused CQ resensitization 
along with a gain of PPQ resistance, highlights the value of extending 
this research to rapidly emerging PfCRT mutations”’®, as a means to 
identify antimalarial drug combinations that could exert opposing 
selective pressures on this transporter. 


Online content 


Any methods, additional references, Nature Research reporting sum- 
maries, source data, extended data, supplementary information, 
acknowledgements, peer review information; details of author con- 
tributions and competing interests; and statements of data and code 
availability are available at https://doi.org/10.1038/s41586-019-1795-x. 


1. Su, X. Z., Lane, K. D., Xia, L., Sa, J. M. & Wellems, T. E. Plasmodium genomics and genetics: 
new insights into malaria pathogenesis, drug resistance, epidemiology, and evolution. 
Clin. Microbiol. Rev. 32, €00019 (2019). 

2. van der Pluijm, R. W. et al. Determinants of dihydroartemisinin-piperaquine treatment 
failure in Plasmodium falciparum malaria in Cambodia, Thailand, and Vietnam: a 
prospective clinical, pharmacological, and genetic study. Lancet Infect. Dis. 19, 952-961 
(2019). 

3. Fidock, D. A. et al. Mutations in the P. falciparum digestive vacuole transmembrane 
protein PfCRT and evidence for their role in chloroquine resistance. Mol. Cell 6, 861-871 
(2000). 


Nature | Vol576 | 12 December 2019 | 319 


Article 


4. 


Martin, R. E. et al. Chloroquine transport via the malaria parasite’s chloroquine resistance 
transporter. Science 325, 1680-1682 (2009). 

Blasco, B., Leroy, D. & Fidock, D. A. Antimalarial drug resistance: linking Plasmodium 
falciparum parasite biology to the clinic. Nat. Med. 23, 917-928 (2017). 

Ross, L. S. et al. Emerging Southeast Asian PfCRT mutations confer Plasmodium 


falciparum resistance to the first-line antimalarial piperaquine. Nat. Commun. 9, 3314 (2018). 


Hamilton, W. L. et al. Evolution and expansion of multidrug-resistant malaria in Southeast 
Asia: a genomic epidemiology study. Lancet Infect. Dis. 19, 943-951 (2019). 

Dhingra, S. K., Small-Saunders, J. L., Ménard, D. & Fidock, D. A. Plasmodium falciparum 
resistance to piperaquine driven by PfCRT. Lancet Infect. Dis. 19, 1168-1169 (2019). 
Pelleau, S. et al. Adaptive evolution of malaria parasites in French Guiana: reversal of 
chloroquine resistance by acquisition of a mutation in pfcrt. Proc. Natl Acad. Sci. USA 112, 
11672-11677 (2015). 

World Health Organization. World Malaria Report 2018. https://www.who.int/malaria/ 
publications/world-malaria-report-2018/en (2018). 

Sullivan, D. J. Jr. Quinolines block every step of malaria heme crystal growth. Proc. Natl 
Acad. Sci. USA 114, 7483-7485 (2017). 

Dhingra, S. K. et al. A variant PfCRT isoform can contribute to Plasmodium falciparum 
resistance to the first-line partner drug piperaquine. mBio 8, e€00303-17 (2017). 
Lakshmanan, V. et al. A critical role for PfCRT K76T in Plasmodium falciparum verapamil- 
reversible chloroquine resistance. EMBO J. 24, 2294-2305 (2005). 

Sanchez, C. P. et al. Differences in trans-stimulated chloroquine efflux kinetics are linked 
to PfCRT in Plasmodium falciparum. Mol. Microbiol. 64, 407-420 (2007). 

Paguio, M. F., Cabrera, M. & Roepe, P. D. Chloroquine transport in Plasmodium falciparum. 
2. Analysis of PfCRT-mediated drug transport using proteoliposomes and a fluorescent 
chloroquine probe. Biochemistry 48, 9482-9491 (2009). 

Sanchez, C. P. et al. Phosphomimetic substitution at Ser-33 of the chloroquine resistance 
transporter PfCRT reconstitutes drug responses in Plasmodium falciparum. J. Biol. Chem. 
294, 12766-12778 (2019). 

Renaud, J. P. et al. Cryo-EM in drug discovery: achievements, limitations and prospects. 
Nat. Rev. Drug Discov. 17, 471-492 (2018). 

Dominik, P. K. et al. Conformational chaperones for structural studies of membrane 
proteins using antibody phage display with nanodiscs. Structure 24, 300-309 (2016). 
Callaghan, P. S., Hassett, M. R. & Roepe, P. D. Functional comparison of 45 naturally 
occurring isoforms of the Plasmodium falciparum chloroquine resistance transporter 
(PFCRT). Biochemistry 54, 5083-5094 (2015). 


320 | Nature | Vol576 | 12 December 2019 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


Quick, M. & Javitch, J. A. Monitoring the function of membrane transport proteins in 
detergent-solubilized form. Proc. Natl Acad. Sci. USA 104, 3603-3608 (2007). 

Lekostaj, J. K., Natarajan, J. K., Paguio, M. F., Wolf, C. & Roepe, P. D. Photoaffinity labeling 
of the Plasmodium falciparum chloroquine resistance transporter with a novel 
perfluorophenylazido chloroquine. Biochemistry 47, 10394-10406 (2008). 

Bellanca, S. et al. Multiple drugs compete for transport via the Plasmodium falciparum 
chloroquine resistance transporter at distinct but interdependent sites. J. Biol. Chem. 
289, 36336-36351 (2014). 

Sa, J. M. et al. Geographic patterns of Plasmodium falciparum drug resistance 
distinguished by differential responses to amodiaquine and chloroquine. Proc. Natl Acad. 
Sci. USA 106, 18883-18889 (2009). 

Juge, N. et al. Plasmodium falciparum chloroquine resistance transporter is a 
H"-coupled polyspecific nutrient and drug exporter. Proc. Natl Acad. Sci. USA 112, 
3356-3361 (2015). 

Kuhn, Y., Rohrbach, P. & Lanzer, M. Quantitative pH measurements in Plasmodium 
falciparum-infected erythrocytes using pHluorin. Cell. Microbiol. 9, 1004-1013 
(2007). 

Ashcroft, F., Gadsby, D. & Miller, C. Introduction. The blurred boundary between channels 
and transporters. Phil. Trans. R. Soc. Lond. B 364, 145-147 (2009). 

Agrawal, S. et al. Association of a novel mutation in the Plasmodium falciparum 
chloroquine resistance transporter with decreased piperaquine sensitivity. J. Infect. Dis. 
216, 468-476 (2017). 

Bopp, S. et al. Plasmepsin I/-III copy number accounts for bimodal piperaquine resistance 
among Cambodian Plasmodium falciparum. Nat. Commun. 9, 1769 (2018). 

Duru, V. et al. Plasmodium falciparum dihydroartemisinin-piperaquine failures in 
Cambodia are associated with mutant K13 parasites presenting high survival rates in 
novel piperaquine in vitro assays: retrospective and prospective investigations. BMC 
Med. 13, 305 (2015). 

Shi, L., Quick, M., Zhao, Y., Weinstein, H. & Javitch, J. A. The mechanism of a 
neurotransmitter:sodium symporter—inward release of Na’ and substrate is triggered by 
substrate in a second binding site. Mol. Cell 30, 667-677 (2008). 


Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© The Author(s), under exclusive licence to Springer Nature Limited 2019 


Methods 


Small-scale expression screen of CRT variants 

Initial expression studies were performed with twelve CRT vari- 
ants, including the P. falciparum crt sequences 7G8 (AF233064), 3D7 
(Q9N623; codon-optimized for Xenopus laevis), Dd2 (D5L5S2; codon- 
optimized for X. laevis), Plasmodium berghei CRT (PbCRT; Q9GSD8), 
Plasmodium knowlesi CRT (PKCRT; Q9GSD7), Plasmodium vivax CRT 
(PvCRT; Q9GSD3, codon-optimized for Spodoptera frugiperda), Thei- 
leria parva CRT (TpCRT; Q4N5R6), Theileria annulata CRT (TaCRT; 
Q4UDS9) and Cryptosporidium parvum CRT (CpCRT; Q7YZ23). To 
express these in mammalian cells, the sequences were cloned into 
pFM1.2*! as a GFP fusion with a decahistidine affinity tag at either the 
5’ or 3’ end of the gene. In brief, 1 pg of each construct was diluted into 
100 pl opti- MEM (Thermo Fisher Scientific) and mixed with 100 pl opti- 
MEM containing 4 pg of polyethylenimine (PEI) maximum molecular 
mass of 40,000 Da (Polysciences). This mixture was added to 1.0 x 10° 
HEK293T cells (Invitrogen) in a total volume of 2 ml in a 6-well plate 
(Corning). The cell transfection mixture was incubated at 37 °C for 
72 hin 5% CO.,. Transfected cells were then collected, centrifuged at 
800g for 10 min at 4 °C and washed once on ice in 1x PBS. Each pellet 
was resuspended and solubilized in 20 mM HEPES pH 7.5, 200 mM 
NaCl, 20 mM MgSO,, 0.5 mM phenylmethylsulfonyl fluoride (PMSF), 
cOmplete EDTA-free protease inhibitor cocktail (Roche), 10 pg mI 
DNase (Roche) and 8 pg ml RNase (Sigma-Aldrich) supplemented with 
1% (w/v) n-dodecyl-fB-D-maltopyranoside (DDM) and 0.1% (w/v) cho- 
lesteryl hemisuccinate (CHS) at 4 °C for 2 h. Insoluble material was 
removed by ultracentrifugation ina single-angle rotor at 4 °C for 45 min. 
The supernatant of each sample was subjected to western blot analysis 
using a rabbit anti-GFP antibody (diluted 1:5,000; Invitrogen) anda 
secondary goat anti-rabbit IgG conjugated to horseradish peroxidase 
(HRP) (diluted 1:7,000; Invitrogen). Labelling was detected using a 
western blot Luminol reagent (Santa Cruz). 

Expression and stability were further evaluated by fluorescence- 
coupled size-exclusion chromatography (FSEC)”. In these assays, 
each GFP-tagged construct was transfected into HEK293 Freestyle 
cells (Invitrogen) in the presence of PEI. Transfected cells were incu- 
bated at 37 °C for 72h with 8% CO, and 70% humidity. Cell supernatants 
were prepared for FSEC analysis as described above. DDM-solubilized 
supernatants were loaded onto a TSKgel G4000SWx! column (Tosoh 
Bioscience) attached to a Prominence UFLC (Shimadzu) coupled toa 
RF-10AXL fluorescence detector (Shimadzu). These studies identified 
the PfCRT 7G8 construct as having the best yield and mono-dispersity. 


PfCRT 7G8 protein expression, purification and reconstitution 
into nanodiscs 

The pfcrt7G8 full-length open-reading frame was cloned into the pEG 
BacMam vector” using the Gibson Assembly method”. pfcrt was fused 
at its 3’ end witha tobacco etch virus protease cleave site (ENLYFQSYV) 
and a decahistidine affinity tag followed by a streptavidin affinity tag 
(WSHPOFEK). The resulting plasmid was transformed into Escherichia 
coliDH10Bac competent cells to generate the bacmid using a bac-to-bac 
(Invitrogen) protocol. Recombinant P1 baculovirus was transfected 
into Sf9 cells (Expression System) in the presence of PEI and cultured 
in ESF 921 protein-free insect cell culture medium (Expression Sys- 
tems). For protein expression, 100 ml P4 virus was used to infect 1-1 
cultures of HEK293S GnTi cells (Invitrogen) at 3 x 10° cells per ml in 
Freestyle 293 medium (GIBCO) supplemented with 2% FBS (GIBCO). 
After infection, the cells were incubated at 37 °C for 8 to 10 hin the 
presence of 8% CO, and 70% humidity. We then added 10 mM sodium 
butyrate (Sigma-Aldrich) to enhance BacMam protein expression. Cells 
were further incubated at 37 °C for 48 h before collection. Cell pellets 
were homogenized in low-salt buffer (10 mM HEPES pH 7.5, 10 mM KCI, 
10 mM MgCl, 0.5 mM PMSF, cOmplete EDTA-free protease inhibitor 
cocktail, 10 pg mI“ DNase l, and 8 pg mI RNase) ina glass homogenizer. 


Membrane fractions were isolated by ultracentrifugation at 40,000gin 
atype 45 Ti rotor (Beckman Coulter). Membrane fractions were further 
homogenized and washed twice with high-salt buffer (10 mM HEPES 
pH 7.5, 10 mM KCI, 10 mM MgCl, 1M NaCl, 0.5 mM PMSF, cOmplete 
protease inhibitor cocktail, 10 pg ml DNasel and 8 pg mI RNase) ina 
glass homogenizer followed by ultracentrifugation. The washed mem- 
brane fractions were resuspended again by homogenizing in buffer 
comprising 20 mM HEPES pH 7.5, 200 mM NaCl and 0.5 mM PMSF and 
cOmplete protease inhibitor cocktail, and stored at —80 °C until use. 

The thawed membrane fraction was solubilized by adding DDM with 
CHS ina 10:1 (w/w) ratio toa final concentration of 1% (w/v) detergent, 
as previously described®, and incubated at 4 °C for 2h with gentle 
agitation. Insoluble material was removed by ultracentrifugation at 
40,000g ina type 45 Ti rotor (Beckman Coulter) at 4 °C for 30 min. 
The supernatant was filtered (0.22-um filters, Millipore), placed ina 
Falcon tube containing pre-equilibrated Ni**-NTA resin (Qiagen) in 
the presence of 20 mM imidazole, and incubated at 4 °C overnight 
with gentle rotation. The resin was washed with 10 column volumes of 
buffer comprising 20 mM HEPES pH 7.5, 200 mM NaCl, 60 mM imida- 
zole, 0.1% DDM and 0.01% CHS. Bound protein was eluted with buffer 
consisting of 20 mM HEPES pH 7.5, 200 mM NaCl, 200 mM imidazole, 
0.05% DDM and 0.005% CHS. The eluted protein was incorporated 
into lipid nanodiscs with a1:400:10 molar ratio of protein:1-palmitoyl- 
-2-oleoyl-sn-glycero-3-phospho-(1’-rac-glycerol):membrane-scaffold 
protein 1D1(MSPI1D1). This mixture was incubated at 4 °C for 2.5 h with 
gentle agitation. Reconstitution was initiated by removing detergent 
by incubating with Bio-beads (Bio-Rad) at 4 °C overnight with constant 
rotation. Bio-beads were removed and the nanodisc reconstitution mix- 
ture was bound again to Ni**-NTA resin for at 4 °C for 2h to remove free 
nanodiscs. The resin was washed with 10 column volumes of wash buffer 
(20 mM HEPES pH 7.5, 200 mM NaCl and 20 mM imidazole) followed 
by 4column volumes of elution buffer (20 mM HEPES pH 7.5, 200 mM 
NaCl and 200 mM imidazole). The eluted protein was further puri- 
fied by loading on a Superdex 200 Increase 10/300 GL size-exclusion 
column (GE Healthcare Life Sciences) in gel-filtration buffer (20 mM 
HEPES pH 7.0 and 150 mM NaCl). 


Identification of PfCRT-specific Fabs using phage display 

For the Fab screen, PfCRT 7G8 was reconstituted into nanodiscs 
using biotinylated MSP1D1. Before nanodisc assembly, MSP1D1 was 
chemically biotinylated as previously described'®****”, Efficient bioti- 
nylation was confirmed using a pull-down assay with streptavidin- 
coated paramagnetic particles (Promega). Biopanning was performed 
using the Fab Library E**”’ that expresses a subset of Fabs on the phage 
surface. Binding was assayed in selection buffer (25 mM HEPES pH 7.4, 
150 mM NaCland 1% bovine serum albumin). Inthe first round, biopan- 
ning was performed manually using 200 nM of PfCRT 7G8-MSP1D1 
nanodiscs immobilized onto magnetic beads. Following three washes 
with selection buffer, the resuspended beads enriched for phage- 
expressing PfCRT-specific Fabs were used to infect log-phase E. coli 
XL-1 Blue bacteria. Phages were amplified overnight in 2x YT medium 
supplemented with ampicillin (100 pg mI) and M13-KO7 helper 
phage (10° PFU mI’). To increase the stringency of selection, four addi- 
tional rounds of biopanning were performed with decreasing target 
concentrations of 150 nM, 75 nM, 40 nM and 20 nM. For each round, 
the amplified phage pool from each preceding round was used as the 
input. For rounds 2-5, biopanning was performed semi-automatically 
using a Kingfisher magnetic beads handler (Thermo Fisher Scientific). 
To reduce the presence of non-specific binders, the phage pools for 
rounds 2-5 were precleared with 100 ul of streptavidin particles. For 
all rounds we used empty, non-biotinylated MSP1D1 nanodiscs as solu- 
ble competitors in excess, at a constant concentration of 1.5 uM. In 
rounds 2-5, bound phage particles were removed by elution from mag- 
netic beads following a 15-min incubation step with 1% Fos-choline-12 
prepared in selection buffer. 
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Single-point phage ELISA to validate Fab binding to PfCRT 

The initial validation for selection was performed by single-point 
phage ELISA using individual clones from rounds 4 and 5. All ELISA 
experiments were performed in 96-well plates (Nunc) coated with 
2 wg mI neutravidin and blocked with selection buffer. Colonies 
of £. coli XL1-Blue bacteria containing phagemids were used to inocu- 
late 400 pl 2x YT medium supplemented with 100 pg mI‘ ampicillin 
and 10° PFU mI‘ M13-KO7 helper phages. Phages were amplified over- 
night in 96-well deep-well blocks at 37 °C with shaking at 280 rpm. 
Amplified phages were diluted tenfold into selection buffer and 
assayed against either PfCRT-loaded or empty biotinylated nanodiscs. 
Target proteins were immobilized at room temperature for 30 min 
followed by incubation with phage dilutions for 30 min. Bound 
phage particles were detected with TMB substrate (Thermo Fisher 
Scientific) following a 30-min incubation with HRP-conjugated anti- 
M13 monoclonal antibody (GE Healthcare). The reaction was quenched 
with 1.0 M HCl and the absorbance was measured at 450 nm. Wells 
containing empty nanodiscs were used to determine non-specific 
binding. 


Fab expression and purification 

Specific binders from phage ELISA were selected based on their signal/ 
background ratio’’ and were sequenced at the University of Chicago 
Comprehensive Cancer Center DNA Sequencing facility. Unique clones 
were sub-cloned in pRH2.2 using the In-Fusion Cloning kit (Takara Bio). 
Sequence-verified Fab expression vectors were then transformed into 
E. coliBL21-Gold competent cells (Agilent), and grown overnight in 
2x YT broth supplemented with 100 pg ml ampicillin at 37 °C. Onthe 
next day, cultures were inoculated into fresh YT broth, grown at 37 °C 
until optical density at 600 nm reached 0.8, and then induced for 4h 
at 37 °C. Cells were collected by centrifugation and stored at —80 °C 
until use. Cell pellets were resuspended in buffer (20 mM HEPES pH 7.4, 
150 mM NaCl and 0.5 mM MgCl,) supplemented with 1 mM PMSF and 
1pg mI“ DNase I (Gold Biosciences). These preparations were sonicated 
(Branson Sonifier) until complete lysis was achieved. Lysates were 
incubated at 60 °C for 30 min to eliminate potential Fab proteolysed 
fragments, subsequently cooled on ice and cleared by centrifuga- 
tion. Supernatants were filtered and then loaded onto a 5-ml HiTrap 
MabSelect SuRe column (GE Healthcare) equilibrated with wash buffer 
(20 mM HEPES pH 7.4 and 500 mM NaCl). The column was washed 
10 times with wash buffer and Fabs were eluted with 0.1 M acetic acid. 
Fractions containing Fabs were loaded onto a1-ml Resource S column 
(GE Healthcare) equilibrated with buffer A (SO mM sodium acetate, 
DH 5.0). Following washing with 10 column volumes of buffer A, 
Fabs were eluted by a linear O-50% gradient with buffer B (SO mM 
sodium acetate, pH 5.0 and 2 M NaCl). Fractions containing Fabs were 
evaluated for purity by SDS-PAGE and subsequently pooled and dia- 
lysed overnight in20 mM HEPES pH 7.4 and 150 mM NaCl. 


Assessment of Fab binding affinity for PfCRT 

To estimate apparent binding affinity, we performed multi-point ELISA 
with purified Fabs’’, which were diluted serially threefold starting 
from a3-.M maximum concentration. ELISA plates were coated and 
blocked as described above for the phage ELISA studies. Then, 50 nM 
of biotinylated PfCRT 7G8-MSP1D1 nanodiscs was immobilized onthe 
plates by a30-min incubation, followed by washing. Fab dilutions were 
added to wells containing immobilized PfCRT and allowed to bind for 
30 min. Bound Fab was detected with TMB substrate following a30-min 
incubation with HRP-conjugated mouse anti-human IgG F(ab’), mono- 
clonal antibody (Jackson). Reactions were quenched with 1.0 M HCl 
and the absorbance was measured at 450 nm. A,;. values were plotted 
against the log,, of the Fab concentration. EC,, values were then calcu- 
lated in GraphPad Prism 8 using a variable slope model and assuming 
a sigmoidal dose response. 


PfCRT 7G8 complex formation with the Fab CTC 

Size-exclusion chromatography (SEC) fractions of purified PfCRT in 
nanodiscs were incubated with the Fab CTC onice for 2 hina1:3 molar 
ratio of protein to Fab. The PfCRT-Fab complex was concentrated 
and filtered, and then loaded on a Superdex 200 Increase 10/300 GL 
size-exclusion column in gel-filtration buffer (20 mM HEPES pH 7.0 
and 150 mM NaCl). 


Negative-stain electron microscopy 

To determine the sample quality and success of the Fab binding, puri- 
fied protein was diluted to 0.01 mg ml (for PfCRT without Fab) or 
0.005 mg mI? (for PfCRT with Fab) and applied onto copper grids (Ted 
Pella). These grids were overlaid by a thin (approximately 1.5 nm) layer 
of continuous carbon that had been plasma-cleaned (Gatan Solarus) for 
30s using a mixture of H, and O,. Afterwards, filter paper (Whatman 4) 
was used to remove the protein solution. Then, 3 pl of 2% uranyl formate 
was added and immediately removed by absorbing with filter paper— 
this was repeated seven times. The grid was imaged on either a Tecnai 
T12 microscope (FEI) (for PfCRT without Fab) or a Tecnai TF20 micro- 
scope (FEI) (for PfCRT with Fab). Both microscopes were equipped 
with a Tietz F416 CCD camera (Tietz) with 1.23 A or 1.10 A per pixel, 
respectively, using the Leginon software package*®. In total, 166 and 
87 images were collected for PfCRT without and with Fab, respectively, 
and all images were processed using the Appion software package“ to 
obtain two-dimensional classes with Relion 2.1%. The micrographs 
showed good particle dispersion. Two-dimensional class averages 
showed that the Fab addition resulted in a clear fiducial for particle 
alignment (Extended Data Fig. 2). 


Single-particle cryo-EM vitrification and data acquisition 

Purified PfCRT-Fab complex was concentrated to 1.56 mg ml"‘usinga 
30-kDa concentrator (Amicon). In brief, 2.5 pl of sample was added to 
a plasma-cleaned (Gatan Solarus) 1.2/1.3-m holey gold grid (Quanti- 
foil UltrAuFoil) and blotted using filter paper on one side for 2s using 
the Leica GP plunger system, before plunging immediately into liquid 
ethane for vitrification. The plunger was operating at 6 °C with >80% 
humidity to minimize evaporation and sample degradation. Images 
were recorded ona Titan Krios electron microscope (FEI) equipped 
with aC, corrector and K2 summit direct detector (Gatan) operating 
at 0.5175 A per pixel (calibrated using an AAV2 dataset) in counting 
mode using the Leginon software package”’. An energy filter slit width 
of 20 eV was used during the collection and was aligned automati- 
cally every hour using Leginon. Data collection was performed using 
a dose of around 91.56 e A? across 80 frames (75 ms per frame) ata 
dose rate of approximately 4.0 e” per pixel per s, using a set defocus 
range of -1.2 pm to -1.8 pm. A100-pm objective aperture was used. In 
total, 3,377 micrographs were recorded over asingle 2.5-day collection 
using an image beam shift data-collection strategy“. Ice thickness was 
monitored after every fourth exposure using the Leginon zero-loss peak 
algorithm* and was determined to have a mean +s.d. of 23.1+9.1nm. 


Data processing 

Video frames were aligned using MotionCor2* with 3 by 3 patches, a 
grouping of 3 anda B-factor of 100 using the Appion software package”. 
Micrograph contrast transfer function (CTF) estimations were per- 
formed using both CTFFind4” and GCTF*’, and the best estimates based 
onconfidence were selected using the Appion software package. Micro- 
graphs that had an estimated resolution at 0.5 confidence level of worse 
than10 A were removed, resulting in 3,297 micrographs. DoG picker” 
was used to pick 682,520 particles (extracted binned by 2), which were 
transferred into Relion 2.1% for two-dimensional classification. Two- 
dimensional class averages that showed clear structural details of PfCRT 
were used as templates for template-based picking using Gautomatch 
(https://www.mrc-Imb.cam.ac.uk/kzhang/Gautomatch). To avoid 


missing any particles, we chose a lenient threshold that resulted in 
1,095,304 initial picks. These were pared down to 183,241 particles after 
Relion two-dimensional classification. Two rounds of CryoSPARC v.2° 
ab initio (using two models) were then performed to further refine the 
particle stack to 35,682 particles, which yielded a resolution of 8.27A 
after three-dimensional homogeneous refinement. Particles were then 
re-centred and re-extracted using Relion and per-particle CTF estima- 
tion using GCTF was then performed. The particle stack was brought 
back into CryoSPARC for one round of ab initio refinement (using two 
models). The best set of particles and model was then used for non- 
uniform refinement. This resulted in a3.7 A map from 17,034 particles. 

The mag distortion_correct software™ was then used on the micro- 
graphs to correct for magnification anisotropy of 1.3%, which was 
determined previously using gold-replica grating using the mag _distor- 
tion_estimate software™. The resulting particles were then subjected to 
three rounds of CryoSPARC non-uniform refinement, re-centering and 
re-extracting in Relion, with removal of overlapped particles centred 
no more than 20 pixels from each other, and CTF refinement using 
cisTEM™. This resulted in a 3.6 A density map from 17,030 particles. 

To optimize the alignment of frames, video frames were re-aligned 
using MotionCor2 with 2 by 2 patches, a grouping of 3, correction for 
in-frame motion, and aB-factor of 150 for global frame alignment and 
50 for local frame alignment. Visual inspection of the resulting micro- 
graphs showed that certain micrographs fared better using the new 
frame alignment parameters, while some fared worse than with the 3 
by 3 patches described above. To determine the best frame-alignment 
parameter ona per-particle basis, particles were extracted from micro- 
graphs using both conditions. These particles were then subjected toa 
CryoSPARC non-uniform refinement and one round of cisTEM refine- 
ment. Thereafter, we chose the alignment parameter that produced 
the highest score. This resulted in 7,191 particles coming from micro- 
graphs aligned with 2 by 2 patches, and 9,714 particles coming from 
micrographs aligned with 3 by 3 patches. Particle-polishing algorithms 
that require neighbouring particles to aid in alignment were tested but 
gave consistently worse results, probably because of the low average 
number of particles per micrograph (around 5.3) caused by the high 
magnification used and sizeable number of free Fab contaminants. 

Next, signal subtraction using Relion was performed. Subtraction of 
just the nanodisc did not improve the overall resolution. However, sub- 
traction of both the nanodisc and constant domain of the Fab improved 
the resolution of the reconstruction, after CryoSPARC non-uniform 
refinement, to resolutions of 3.3 A overall and 3.2 A for PfCRT alone. 
This final map was sharpened using phenix.auto_sharpen”, which 
automatically selected b_iso sharpening to high_resolution cutoff as 
the algorithm to use. Overall b sharpen applied was 28.3 A’, final b_iso 
obtained was 47.3 A?and the high-resolution cut-off was 3.2 A. All con- 
versions between Relion, CryoSPARC and cisTEM were performed using 
the pyem script (D. Asarnow, unpublished materials, https://github. 
com/asarnow/pyem). 


Structural model building and refinement 
To build the PfCRT model, we first used the Chimera™ program to 
approximately segment out the PfCRT part of the map from the nano- 
disc and Fab density. De novo model building for the PfCRT primary 
sequence in the cryo-EM map was initiated using Rosetta®. After two 
rounds of Rosetta model building, the partial model was brought into 
Coot” for manual model building. For Fab model building, the Fab 
portion of the deposited yeast nucleoprotein complex (PDB 4XMM)*” 
was used as a Starting template, as it had the same scaffold backbone. 
An extra electron density between PfCRT helices 1, 9 and 10 was not 
modelled by protein density and was likely of lipid origin. Out of the 
lipids present in the nanodisc, CHS produced the best density fit and 
was therefore incorporated into the model. 

Subsequently, model adjustment and refinement were performed 
iteratively in Coot and Phenix, with the statistics examined using 


Molprobity® until no further improvements were observed. The final 
map and model were then validated using: (1) EMRinger®’ to compare 
map to model; (2) ResMap® to calculate map local resolution; and 
(3) a program suite with three-dimensional Fourier shell correlations 
(FSCs)*. To calculate the FSC values, we used a cut-off criterion® of 
0.143. The three-dimensional FSC program suite® was used to calculate 
the degree of directional resolution anisotropy. Map-to-model FSCs 
were also calculated by first converting the model to a map using the 
Chimera molmap function at Nyquist resolution (2.07 A). Amask was 
generated from this map using Relion (after low-pass filtering to 8 A, 
extending by 1 pixel and applying a cosine-edge of 3 pixels) and was 
then applied to the density map. Map-to-model FSCs were calculated 
using EMAN®. 


Modelanalysis 

Acavity search using the Solvent Extractor from Voss Volume Voxelator 
server was performed using an outer-probe radius of 10 A andinner- 
probe radius of 3 A. We then useda Dali server® to search for other PDB 
structures with a similar fold. This search produced top results for Vrg4 
(PDB 50GE and 50GK)*, YddG (PDB 5120)”, TPT (PDB 5Y78 and 5Y79)°®, 
Zea mays CST (PDB 6I1R)° and Mus musculus CST (PDB 60H4)”, with 
Zscores ranging from 26.7 to 16.0. Thereafter, there was a large drop 
inZscore for the next best hit at 8.5 (the chloride pumping rhodopsin, 
PDB 5B2N) that visually did not superimpose well on PfCRT (the root 
mean square deviation of superimposition was 8.1 A). 


Amino acid sequence alignments 

For conservation analysis in apicomplexan parasites, we obtained 11 
orthologues of CRT from OrthoMCL-DB”. Their amino acid sequences, 
in addition to the PfCRT Dd2 and wild-type isoforms, were then aligned 
using MUSCLE”, resulting in 13 orthologues in total. For the sequences 
used for Extended Data Fig. 7, the sequences used are P. falciparum 
PfCRT strain 7G8 (UNIPROT W7FI162) and its orthologues PfCRT Dd2 
(UNIPROT F5CEB4), wild-type PfCRT (the canonical 3D7 wild-type 
sequence; UNIPROT Q9N623), P. reichenowi CRT (PrCRT; UNIPROT 
AOA2P9D9K2), P. vivax CRT strain Sal-1 (PvCRT; UNIPROT Q9GSD3), P. 
knowlesiCRT strain H (PKCRT; UNIPROT Q9GSD7), P. bergheiCRT strain 
ANKA (PbCRT; UNIPROT Q9GSD8), P. chabaudi CRT strain chabaudi 
(PcCRT; UNIPROT Q7ZOV9), Babesia microti CRT strain RI (BmCRT; UNI- 
PROT AOAIN6LY67), 7. annulata CRT strain Ankara (TaCRT, UNIPROT 
Q4UDS9); Eimeria tenella CRT (EtCRT; UNIPROT U6L1M8), Toxoplasma 
gondii CRT (TgCRT; UNIPROT S8EU26) and Cryptosporidium hominis 
CRT strain TUSO2 (ChCRT; UNIPROT AOAOS4THJ3). Multiple sequence 
alignments were displayed using ESPript”. 


Computational studies 
All calculations were performed using the Schrédinger molecular 
modelling suite (version 2019-1). The PfCRT protein structure solved 
herein was analysed using the Protein Preparation Wizard”. In this 
step, force field atom types and bond orders were assigned, missing 
atoms including hydrogens were added, tautomer/ionization states 
were assigned, Asn, Gln and His residues were flipped to optimize the 
hydrogen bond network, and a constrained energy minimization was 
performed. For mutation modelling, including to generate models of 
Dd2 and WT (3D7), point mutations were introduced into the PfCRT 7G8 
structure using the Residue and Loop Mutation tool. The refinement 
step consisted of alocal minimization in implicit solvent with Prime”. 
For simulations of molecular dynamics, we used the same proto- 
col to simulate the 7G8 isoform and the different single amino acid 
variants. First, the protein was aligned so that the membrane plane 
corresponded to the xy plane of the coordinate system, using the 
OPM (Orientation of Proteins in Membranes) database”. Then, the 
protein was embedded in a 1-palmitoyl-2-oleoyl-glycero-3-phospho- 
choline (POPC) membrane model, included in an orthorhombic box 
of simple point-charge (SPC) water solvent with a10 A buffer on each 
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dimension, and neutralized by adding the appropriate number of Cl” 
ions, using the System Builder panel. The simulations of molecular 
dynamics were performed using Desmond”. Each simulation started 
with a system relaxation with a default protocol that was followed by 
a 250-ns production run in an NPgT ensemble, at a temperature of 
300 K and a pressure of 1.01325 bar. The resulting trajectories were 
analysed using the Simulation Event Analysis tool to monitor distances 
of interest and the frames for each trajectory were clustered using the 
Desmond Trajectory tool. 

The Poisson-Boltzmann electrostatic potential surface panel, which 
uses the adaptive Poisson-Boltzmann solver method, was used to 
calculate the electrostatic potential surfaces. The solute and solvent 
dielectric constants were set to 1 and 80, respectively, solvent radius 
was 1.4 A, temperature was 298 K and grid extension was 5A. The open- 
to-cytosol homology model of PfCRT was created using Maestro’s 
Multiple Sequence Alignment tool. Leveraging the inverted structural 
repeats of PfCRT that are related by two-fold pseudo-symmetry, the 
sequence of each TM helix was aligned and mapped onto the structure 
of the related TM helix, as previously carried out for other transporters 
with inverted structural repeats”’”® ®°. The positions of the loops and 
JM helices in the open-to-cytosol conformation could not be defined 
owing toa lack of homology in these regions. The individually aligned 
TM helices were merged to give the final model, which was further 
refined using restrained minimization from the Protein Preparation 
Wizard panel”. 


Production of pfcrt gene-edited parasites 

7G8 parasites were engineered to express pfcrt variants that added the 
F145] or C350R mutations, or the 7G8 reference sequence, into the 7G8 
coding sequence, using a previously published method based on zinc- 
finger nuclease-mediated gene editing’™. Edited clones were recovered 
by limiting dilution. Parasite lines were authenticated by sequencing 
the pfcrt alleles and whole-genome sequencing of the 7G8 parental 
line (provided by T. Wellems, NIAID/NIH). Testing for mycoplasma 
revealed no contamination. 


[3H]CQ and [?H]PPQ accumulation assays in cultured parasites 

This protocol has been previously described in detail®. In brief, sorb- 
itol-synchronized parasite cultures of recombinant 7G8, 7G8 + F145l, 
7G8 + C350R, Dd2 and 3D7 (expressing wild-type pfcrt) were collected 
as early trophozoites (24-26 h after invasion) and magnet-purified 
(VarioMACS, Miltenyi Biotec). These lines are referred to herein as 
7G8, 7G8 + F145I, 7G8 + C350R, Dd2 and wild-type (3D7). The pure 
trophozoite-infected erythrocytes were eluted at 37 °C in fresh1x PBS 
containing 2 mM EDTA and supplemented with 0.5% bovine serum 
albumin. The eluate was then centrifuged (1,500 rpm, 5 min), and the 
pellet was washed and resuspended in bicarbonate-free RPMI 1640 
medium (supplemented with 25 mM HEPES, 10 mM glucose and 0.2 mM 
hypoxanthine and adjusted to pH 7.4) at 37 °C. The exact incubation cell 
numbers were determined by counting an aliquot of the resuspended 
cells using a haemocytometer. Uptake measurements began with the 
addition of 250 ul of trophozoite suspension, at an estimated haemato- 
crit of 30,000-120,000 cells per ul, to an equal volume of 1OnM ?H]JCQ 
(60 Cimmol™; American Radiolabelled Chemicals) or 40 nM ?H]PPQ 
(15 Cimmol™; American Radiolabelled Chemicals) in bicarbonate-free 
medium to yield final [7H]CQ and [7H]PPQ concentrations of 5nM and 
20 mM, respectively. Parallel control measurements were conducted 
with uninfected erythrocytes at the same haematocrit. After 1h of 
incubation at 37 °C ina water bath, duplicate 200 pl aliquots of the 
suspensions were transferred into 1.5-ml Eppendorf tubes containing 
300 pl of dibutyl phthalate (Sigma Aldrich, 1.04 g mI) and centrifuged 
immediately (13,000 rpm; 2 min) to sediment the cells through the oil, 
thereby terminating [?H]-drug uptake. From each duplicate, 100 ul 
of supernatant was transferred into a vial containing 5 ml of scintilla- 
tion fluid (CytoScint, MP Biomedicals) to determine the external drug 


concentration [Drug.,rernal.- The pellets were digested by overnight 
incubation at 55 °C with 100 ul tissue solubilizer (NCS-II, GE Healthcare) 
diluted 1:2 in absolute ethanol, then bleached the next day with 30% 
hydrogen peroxide (25 I) followed by addition of 25 ul glacial acetic 
acid to block luminescence. Digested pellets were transferred into a 
vial containing 10 ml of scintillation cocktail and measured ina scintilla- 
tion counter to determine the internal drug concentration [Druginternal- 
After adjusting for non-specific uptake by subtracting measurements 
from the uninfected erythrocyte controls, the CAR was calculated as 
the ratio of [Drugintermnal/[Drugexrernatl, normalized to 1x 10° infected 
erythrocytes and assuming a mean volume of a trophozoite-infected 
erythrocyte of 75 femtolitres®. 


Parasite PPQ survival assays 

PPQ survival assays were carried out as previously described with 
slight modifications*. In brief, double sorbitol-synchronized ring- 
stage parasites (0-6 h after invasion) were seeded at 1% parasitaemia 
and 1% haematocrit in 96-well flat-bottom plates containing atwofold, 
eight-point PPQ drug dilutions starting at 1,600 nM. Parasites were 
incubated with the drug for 48 h at 37 °C. The drug was then removed 
by washing the plates four times with complete medium using a TECAN 
Freedom EVO 100 liquid handler. The assay plates were further incu- 
bated for 24 h before parasitaemia was measured using an Accuri C6 
flow cytometer by double staining with SYBR Green| and MitoTracker 
Deep Red (to label DNA and the mitochondria, respectively). Percentage 
survival was calculated for each line by dividing the parasitaemia of 
the PPQ-treated parasites by the parasitaemia of the no-drug control. 
Statistical significance was determined by nonparametric, two-tailed 
Mann-Whitney U-tests using GraphPad Prism 8 software. 


Parasite drug-susceptibility assays 

Parasite susceptibility to antimalarial drugs was measured by incubat- 
ing asynchronous asexual blood-stage parasites to a range of drug con- 
centrations (twofold, ten-point) starting at 0.3% parasitaemia and 1% 
haematocrit in 96-well plates. The assay plates were incubated at 37 °C 
for 72 hin an environment with 5% O,/5% CO,/90% N;. Final parasitae- 
mias were measured using flow cytometry. IC,. values were calculated 
by nonlinear regression analysis. Drug-susceptibility assays were per- 
formed with CQ, monodesethyl-chloroquine (the in vivo metabolite of 
CQ) and PPQ. Statistical significance was determined by nonparamet- 
ric, two-tailed Mann-Whitney U-tests using GraphPad Prism 8 software. 


Binding studies 

All radiolabelled compounds were purchased from American Radi- 
olabelled Chemicals. Unless other concentrations are indicated, bind- 
ing experiments used 125 nM [PH]Arg (40 Ci mmol!"), 370 nM PH]CQ 
(5.45 Cimmo!) or 75 nM PH]PPQ (15 Cimmol!”). Scintillation proximity- 
based assay (SPA) binding” of 200 ng of PfCRT 7G8, reconstituted 
into nanodiscs, was assayed in buffer composed of 50 mM Tris/MES, 
pH 7.5, 5% glycerol, 0.1 mM TCEP, 0.1% PEI in the presence of 125 pg of 
YSi-His tag SPA beads (Perkin Elmer), and inthe presence or absence of 
indicated reagents or Fab. Then, 800 mM imidazole, which competes 
with the His tag (fused to PfCRT) for binding to the copper-coated 
YSi SPA beads, was used to determine the non-proximity background 
signal. Concentrations of radiolabelled compounds were chosen in 
pilot experiments to obtain a signal-to-noise ratio = 5, and specific 
binding was determined by subtracting the counts per minute in the 
presence of imidazole from the counts per minute measured in its 
absence. Control binding experiments of [?H]CQ were carried out using 
nanodiscs containing 100 ng of the unrelated Aquifex aeolicus bacterial 
amino acid transporter LeuT. As the SPA technology prevented reliable 
binding experiments at pH <7.5, binding of H]CQ (1Cimmo!) or PH] 
PPQ (2.5 Cimmol") was performed with equilibrium dialysis using the 
HTD96b dialysis 96-well apparatus and 12,000-14,000 MWCO mem- 
branes in50 mM Tris/MES, pH 7.5 or pH 5.5, 5% glycerol, 0.1mM TCEP for 


4hat4 °C with 4.2 pmol of the indicated PfCRT variant in nanodiscs®. 
Nonlinear regression analysis of the data was performed with Graph- 
Pad Prism 7. Kinetic constants represent the mean +s.e.m. of the fit. 


Transport measurements 

Purified PfCRT variants were reconstituted in preformed liposomes 
made of E. coli total lipids: CHS 94:6 (w/w) at a protein-to-lipid ratio 
of 1:150 (w/w). The lumen of the proteoliposomes was composed of 
100 mM KP,, pH 7.5 and 2 mM B-mercaptoethanol. Uptake of PH]Arg 
(40 Cimmol!), PHICQ (5.45 Ci mmol”) or PH]PPQ (15 Ci mmol”) was 
performed by diluting PfCRT-containing proteoliposomes (40 ng 
PfCRT per assay) in 50 pl of 1OO mM Tris/MES, pH 5.5 in the presence 
or absence of the indicated compounds. Unless indicated otherwise, 
1M valinomycin was added to the reaction to generate a K’ diffusion 
potential-driven membrane potential. The reactions were stopped 
after the indicated time periods by the addition of ice-cold 100 mM KP,, 
pH6.0, 100 mM LiCl and filtered through 0.45-um nitrocellulose filters 
(Millipore). Filters were dried and incubated in scintillation cocktail, 
and the radioactivity captured on the filters was counted in a Hidex 
SL300 scintillation counter. Specific radioactivity of all compounds 
was confirmed by counting known amounts of each radiolabel by the 
determination of the decay per minute to transform the decay per 
minute of the samples into pmol. Unspecific binding of all compounds 
with the nitrocellulose filters was determined by measuring the mock 
uptake in the absence of liposomes or proteoliposomes, and these 
values (determined for each experiment) were used to calculate the 
uptake activity in liposomes or proteoliposomes. The specific uptake 
was determined by subtracting the time-dependent accumulation 
of the tested compounds in control liposomes (lacking PfCRT) from 
that measured in PfCRT-containing proteoliposomes. Kinetic con- 
stants (K,, and V,,,,) were determined by measuring the initial rates of 
PH]Arg, 2H]CQ or PHIPPQ for periods of 3 s by fitting the data to the 
Michaelis-Menten equation in GraphPad Prism 7 and represent the 
mean +s.e.m. of the fit. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All raw video frames, micrographs, the particle stack and relevant 
metadata files have been deposited into EMPIAR, with accession code 
EMPIAR-10330. The electron density map has been deposited into 
EMDB, with accession code EMD-20806. The model has been deposited 
inthe PDB, with accession code 6UKJ. All data are available in the paper 
or Supplementary Information. 
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Extended Data Fig. 1| Major PfCRT haplotypes and location of residues and are associated with PPQ resistance. b, Localization of the mutated residues 
involved in CQ or PPQ drug-resistance phenotypes. a, The PfCRT canonical in PfCRT, based onthe solved PfCRT structure. Most mutated residues localize 
CQ-S, PPQ-S 3D7 haplotype, the CQ-R 7G8 (South America and Western Pacific) within or near the boundary of one of the 10 TM helices. Extended Data Table 2a 
and Dd2 (Southeast Asia) haplotypes, and the 7G8 + C350R and Dd2+F1451 provides additional information about the mutated residues in PfCRT 


variants that have emerged in P. falciparum parasites in malaria-endemic areas observed inthe field or obtained in drug-pressured cultured parasites. 
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Extended Data Fig. 2| See next page for caption. 


Extended Data Fig. 2 | Identification of PfCRT-specific Fabs and preparation 
of purified PfCRT with or without Fabs in nanodiscs. a, Complementarity- 
determining region (CDR) sequences of unique Fabs bioassayed for binding to 
recombinant PfCRT 7G8 incorporated into MSP1D1 nanodiscs. Fabs were 
selected following multiple rounds of enrichment from the phage display 
library E°>°*°, Ser-Tyr-Gly-Trp (SYGW) residues, which play a dominant partin 
antigen recognition, are highlighted by colour code (red, yellow, green and 
blue, respectively) and are numbered according to the Kabat system®’”. CTC was 
selected to forma stable PfCRT 7G8-Fab complex. b, Single-point ELISA 
quantification of the binding of phage-displayed Fab to PfCRT-incorporated 
biotinylated nanodiscs, empty nanodiscs or buffer (empty wells), measured at 
450 nm absorbance (n=1).c, ECs, evaluation of purified Fab binding to PfCRT 
incorporated into biotinylated nanodiscs, showing high-affinity binding for 


the Fab CTC (3.7 nM). Data are mean +.s.d. of n=3 independent experiments. d, 
High-performance liquid chromatography profile of PfCRT 7G8 with or 
without bound Fab CTC. e, SDS-PAGE gel of pooled and concentrated SEC 
fractions from PfCRT 7G8 with or without Fab CTC. The contaminant glutamate 
dehydrogenase (GDH; present as a left shoulder ind) was excluded from single- 
particle analyses. MSP1D1is amembrane-scaffold protein used to assemble the 
nanodiscs. The identity of PFCRT and GDH was confirmed using mass 
spectrometry. f, Representative negative-stain two-dimensional class averages 
from Relion*® two-dimensional classification of nanodisc-incorporated PfCRT 
without Fab. g, Representative negative-stain two-dimensional class averages 
from Relion two-dimensional classification of nanodisc-incorporated PfCRT 
with bound Fab CTC. 
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Extended Data Fig. 3 | Cryo-EM analysis of the PfCRT 7G8-Fab CTC complex. 
a, Representative micrograph (0.5175 A per pixel, 1.65 um defocus). Example 
picked particles contributing to the final reconstruction are circled in green. b, 
Representative two-dimensional class averages from Relion two-dimensional 
classification. c, Flowchart of cryo-EM data processing and refinement of the 
PfCRT 7G8-Fab CTC complex. See Methods for details. d, FSC curves for PfCRT 
7G8 complexed with the Fab CTC variable domain as well as for PfCRT alone. 
Data show the half-map (blue) and map-to-model (purple) resolutions (at 0.143 
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and 0.5 cut-offs, respectively), with embedded histograms of directional 
resolutions sampled evenly over the three-dimensional FSC” (yellow). The 
corresponding sphericity values are indicated. e, Euler angle distribution plot 
of the final three-dimensional reconstruction from CryoSPARC v.2™. f, Local 
resolution® display of unsharpened reconstructions of PfCRT complexed with 
the variable domain of Fab CTC, in orthogonal views, sliced through the density 
for the first view. 
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Extended Data Fig. 4| Fit of cryo-EM density with model. Cryo-EM densities (mesh) are superimposed on TM andJM helices of the PfCRT model. The modelis 
rendered asacartoon, coloured in rainbow. 
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Extended Data Fig. 5 | See next page for caption. 


Extended Data Fig. 5| PfCRT symmetrical arrangement, topology of TM 
helices and structural comparison with other DMT superfamily members. a, 
View of PfCRT from opposite directions of the digestive vacuole and cytosol, 
with labelling of the four helices closest to the centre (thus contributing to the 
cavity). b, Model of the TM helices inthe PfCRT 7G8 structure in the open-to- 
cytosol conformation. c, Structural conservation of PfCRT compared with the 
DMT family members Vrg4 (aGDP-mannose transporter; PDB S5OGE and 
50GK)°, YddG (an amino acid transporter; PDB 5120)”, TPT (a triose- 
phosphate/phosphate translocator; PDB 5Y78 and 5Y79)®, and Z. mays CST 


and M. musculus CST (two CMP-sialic acid transporters; PDB 611R and 60H4, 
respectively)”°. Conservation profiles were generated using the Daliserver®. 
d, The electrostatic potential surface representation of slices reveals the 
cavities of these six proteins in their solved structural states. Note the highly 
negatively charged (red) residues inthe PfCRT cavity, contrasting with the 
positively charged (blue) or neutral residues inthe other DMT transporters 
(shownas slices to emphasize the individual cavities). Ligands for the non- 
PfCRT transporters are shownas a ball and stick representation. 
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Extended Data Fig. 6| See next page for caption. 


Extended Data Fig. 6 | The structure of the PfCRT 7G8-Fab CTC complex 
shows the sequence conservation, localization of CHS, chemical structure 
of CQ and PPQ, and conservation of CQ-Rand PPQ-R mutations. a, PfCRT and 
the bound Fab are rendered in cartoon. PfCRT is coloured according to 
sequence conservation (see Extended Data Fig. 7) and the Fab is coloured in 
pink. Inset shows a magnified view of the interaction between PfCRT 7G8 and 
the Fab CTC. b, PfCRT cleft formed byJM1, TM1, TM9 and TM10 demonstrates 
CHS placement. c, PfCRT is rendered asa surface coloured by hydrophobicity, 
from orange (hydrophobic) to blue (hydrophilic). d, Two-dimensional diagram 


of PfCRT-CHS interactions observed in the structure, generated using LigPlot* 
v.2.1°°. e, Chemical structures of CQ and PPQ, with the 4-aminoquinoline rings 
shaded. f, Location of CQ (left) and PPQ (right) resistance-associated 
mutations (with side chains rendered as sticks and spheres), modelled onto the 
PfCRT 7G8 structure. Models are colour-coded according to sequence 
conservation derived from 11 apicomplexan species (Extended Data Fig. 7). TM 
helices associated with CQ or PPQ resistance are labelled and the areas 
highlighted with coloured circles. 
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Extended Data Fig. 7 | See next page for caption. 


Extended Data Fig. 7 | Sequence alignment and secondary structure of 
PfCRT. Sequences of different PfCRT isoforms and other CRT homologues 
were aligned using MUSCLE” and displayed using ESPript’’. Sequences of 

ten orthologues of CRT in other Plasmodium or other apicomplexan parasites 
were obtained from OrthoMCL-DB”.. The sequences used are PfCRT 7G8 
(UNIPROT W7FI62) and its orthologues PfCRT Dd2 (UNIPROT F5CEB4) and 
wild-type PfCRT (the canonical 3D7 wild-type sequence; UNIPROT Q9N623), P. 
reichenowiCRT (PrCRT; UNIPROT AOA2P9D9K2), P. vivax CRT strain Sal-1 
(PvCRT; UNIPROT Q9GSD3), P. Knowlesi CRT strain H (PKCRT; UNIPROT 
Q9GSD7), P. berghei CRT strain ANKA (PbCRT; UNIPROT Q9GSD8), P. chabaudi 
CRT strain chabaudi (PCCRT; UNIPROT Q7ZOV9), B. microti CRT strain RI 
(BmCRT; UNIPROT AOAIN6LY67), 7. annulata CRT strain Ankara (TaCRT, 
UNIPROT Q4UDS9); F. tenella CRT (EtCRT; UNIPROT U6L1M8), T. gondii CRT 


(TgCRT; UNIPROT S8EU26) and C. hominis CRT strain TUSO2 (ChCRT; UNIPROT 
AOAOS4THJ3). The secondary structure of PfCRT is shown as a cartoon above 
the alignments with residue numbering corresponding to the PfCRT 7G8 
reference, along with the positioning of the variant residues indicated in 
Extended Data Fig. 1 (sharing the same colour scheme). The highly conserved 
cysteine residues that probably form disulphide bonds are C289-C312 and 
C301-C309. The degree of conservation was calculated by considering all 
sequences including P. falciparum 3D7 (wild type) but excluding the 7G8 and 
Dd2 variants. Residues conserved (that is, identical or similar) in at least 10 out 
of 13 species are indicated in red text. Residues conserved in all species arein 
white text with red highlighting. None of the mutations associated with CQ or 
PPQ resistance mapped to residues that are fully conserved across these 
apicomplexan species. 
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Extended Data Fig. 8 | Binding and transport assays for PfCRT 7G8. a, Total 
binding of 125 nM PH]Arg to 100 ng of nanodisc-incorporated PfCRT 7G8 was 
measured in the absence (-) or presence (+) of 800 mM imidazole (which 
competes with the His-tagged PfCRT 7G8 isoform for binding to the copper- 
coated YSi SPA beads). Dataare mean +s.e.m. (n=3 independent experiments); 
grey symbols show the data mean of technical replicates (n=3) of each 
independent experiment. b, Isotopic dilution of 125 nM PH]Arg with non- 
radiolabelled Arg revealed alog(EC,,) value of -3.40 + 0.033 (corresponding to 
397 uM). We note that the primary source of Arg in parasitized red blood cells is 
from the proteolysis of haemoglobin, which in its native state” as a tetramer is 
present at approximately 5 mM. Data are mean +s.e.m. (n=3 independent 
experiments of technical triplicates). c, Binding of 125 nM PH]Arg inthe 
presence or absence of 10 pM verapamil (VP), 14M CQ or PPQ, 0.11.M 


-5 -4 
Log [compound] (M) 


[PHJArg Binding (% 


3 -2 


amodiaquine (ADQ), or1mM Arg, Lys or Leu. Data were normalized to the signal 
in the absence of the respective non-radiolabelled compound and are 

mean +s.e.m. (n=3 independent experiments); grey symbols show the data 
mean of technical triplicates of each independent experiment. d, e, Uptake of 
370 nM [?H]CQ (d) or 250 nM [PH]Arg (e), was measured for 1-min periodsin 
PfCRT 7G8-containing proteoliposomes preloaded with 100 mM KP,, pH 7.5 
diluted in buffer composed of 50 mM Tris/MES, pH5S.5 or pH 7.5 with or without 
the K* ionophore valinomycin (VIm; 5 pM). ctl, control. The valinomycin- 
mediated K’ efflux proceeding down its concentration gradient generated an 
inside-negative membrane potential. Empty liposomes lacking PfCRT 7G8 
served as controls. Data are mean +s.e.m. (n=3 independent experiments); 
grey symbols show the data mean of technical triplicates of each independent 
experiment. 
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Extended Data Fig. 9| See next page for caption. 
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Extended Data Fig. 9 | Role of conserved cysteine residues in CQ binding and 
transport, and parasite PPQ dose-response data. a, Cysteine residues that 
probably form disulphide bonds on the loop adjacent toJM2 and connecting to 
TM8 are shown in yellow and rendered as sticks. Their locations are shownas an 
inset of the overall structure and are coloured by conservation (as per 
Extended Data Fig. 7). b, Effect of reducing conditions on the binding of [H]CQ 
by PfCRT 7G8, 7G8 + C289A and 7G8 + C301A. SPA binding of 100 nM [?H]CQ was 
performed with 200 ng of each protein variant reconstituted into nanodiscs at 
pH/7.5 inthe presence of increasing concentrations of B-mercaptoethanol. 
Data (shownas mean +s.e.m. of n=3 independent experiments with technical 
triplicates) were normalized with regard to the activity measured for PfCRT 
7G8 inthe absence of the reducing agent. c, Effect of the reducing agent 
B-mercaptoethanol on [?H]CQ transport by PfCRT 7G8, 7G8 + C289A or 

7G8 +C301A. Uptake of 100 nM [3H]CQ was measured for 1-min periods in 
control liposomes and proteoliposomes containing the indicated PfCRT 


variants, in the presence of increasing concentrations of B-mercaptoethanol. 
Data are means of n=2 independent experiments performed as n=4 technical 
replicates and were subjected to nonlinear regression fitting using hyperbolic 
decay models with either two (for 7G8 and 7G8 + C289A) or three (including 
7G8 + C301A) parameters. Kinetic constants are shown as mean + s.e.m. of the 
fits. The IC,, (concentration of B-mercaptoethanol yielding half-maximal 
reduction of transport) for 7G8 and 7G8 + C289A were 1.24 + 0.14 mM and 
2.58+0.39 mM, respectively. For 7G8 + C301A the IC,;, was 4.17 + 0.16 mM witha 
remaining activity of 60.2 + 8.2% (compared to the activity in the absence of the 
reducing agent). d, PPQ dose-response data for pfcrt-edited and control 
parasite lines, generated from 72-h growth inhibition assays. Data are 

mean ts.e.m. fromn=4 independent assays performed in duplicate (n=2). 
Statistical significance was determined using two-tailed Mann-Whitney U- 
tests. 
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Extended Data Fig. 10 | Electrostatic potential surfaces of isoform-specific 
PfCRT cavities and simulations of molecular dynamics. a, Electrostatic 
surfaces for the solved open-to-digestive-vacuole conformation for 7G8 and 
modelled isoforms, predicted at pH 5. Images are presented as a vertical slice 
through the transporter, showing net charges in the cavity and locations of TM 
helices. The row below shows the electrostatic surfaces for the homology 
models of these PfCRT isoforms, illustrated in their open-to-cytosol 
configuration, predicted at pH 7.0. b, Surface representation of the 
electrostatic potential of the central cavity of 7G8 and the 7G8 + C350R variant, 
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shown in two different orientations with red and blue indicating negatively and 
positively charged residues, respectively. c, Simulations of molecular 
dynamics on the 7G8 structure (without the Fab) over 250-ns trajectories, 
establishing the equilibrium positions of protein side chains and distances 
between position 145 and residues on proximal helices. d, Simulations of 
molecular dynamics on the 7G8 structure (without the Fab) over 250-ns 
trajectories, establishing the equilibrium positions of protein side chains 
between position 350 and residues on proximal helices, and showing marginal 
movement between the C350 and the 350R isoforms. 
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Extended Data Table 1| Cryo-EM data collection, refinement and validation statistics 


PfCRT with Fab-Variable Domains 
(EMD-20806) 


(PDB 6UKJ) 


Data collection and processing 


Magnification 215,000 
Voltage (kV) 300 
Electron exposure (e-/A?) 91.56 
Exposure time (s) 6 
Dose rate (e-/pixel/s) 4 
Defocus range (um) 1.5 + 0.23 
Pixel size (A) 0.5175 
Symmetry imposed Cl 
Number of micrographs 3,377 
Initial particle images (no.) 183,241 
Final particle images (no.) 17,030 
Map resolution (A) 3.3 
FSC threshold 0.143 
Map resolution range (A) 
Local 2.6-5.6 
Directional 3.1-3.3 
Sphericity of 3DFSC 0.99 
Refinement PfCRT Fab-Variable Domains 
Initial model used (PDB code) - 4XMM 
Model resolution (A) 3:5 
FSC threshold 0.5 
Model resolution range (A) 2.6-5.6 
Map sharpening B factor (A?) -28.3 
Residue range 47-113 and 123-405 Heavy Chain: 4-129 
Light Chain: 2-106 
Model composition 


Non-hydrogen atoms 2810 1780 
Protein residues 350 231 
Ligands 1 0 
B factors (A?) 
Protein 81.1 74.5 
Ligand 82.5 - 
R.m.s. deviations 
Bond lengths (A) 0.0074 0.0076 
Bond angles (°) 1.16 1.19 
Validation 
MolProbity score 1.55 1.81 
Clashscore 5.91 6.63 
Poor rotamers (%) 0.96 0.52 
EM-Ringer Score 3.49 3.33 
Ramachandran plot 
Favored (%) 96.5 93.0 
Allowed (%) 3.5 6.1 


Disallowed (% 0.0 0.9 


Extended Data Table 2 | Characteristics of PfCRT mutations and isoforms 


a 
. ‘ Lining ‘ ‘ 
Mutations | Origin pesecsated assented Confirmed' | Location the Conseevanen bake ecoreson Citation 
Haplotype Phenotype cavity Plasmodium’ | Apicomplexa’ 
C728 Field 7G8 CQ-R (PPQ-S) Yes TMI Yes 83% 45% 3 
M741 Field Dd2 CQ-R (PPQ-S) Yes TMI Yes 100% 55% 3 
CQ-R (PPQ-S) 100% 82% 3 
Dd2 and 7G8 CQ-R (PPQ-S) 100% 55% 3 
Dd2+T93S PPQ-R (CQ-R) 100% 64% 7,8 
Dd2+H97Y PPQ-R (CQ-R) 100% 55% 6 
Dd2+C101F PPQ-R (CQ-S) 100% 55% 12 
Dd2+F 1451 PPQ-R (CQ-S) 67% 36% 6,27 
I218F Dd2+1218F PPQ-R (CQ-R low) TM6 100% 55% 7,8 
A220S Dd2/7G8 CQ-R (PPQ-S) TM6 67% 55% 3 
Q271E Dd2 CQ-R (PPQ-S) TM7 - JM2 67% 36% 3 
N326S/D Dd2/7G8 CQ-R (PPQ-S) TM8 100% 82% 3 
M343L Dd2+M343L | PPQ-R low (CQ-S) TM9 100% 55% 6 
C350R Field 7G8+C350R PPQ-R low (CQ-S) Yes TM9 Yes 100% 55% 9; this study 
G353V Field | Dd2+G353V | PPQ-R mod. (CQ-S) Yes TM9 Yes 100% 55% 6 
I356T/L_ | Field Dd2/7G8 CQ-R (PPQ-S) Yes TM9 Yes 100% 55% 3 
R3711 Field Dd2 CQ-R (PPQ-S) Yes TM9 - 10 No 67% 36% 3 
b cQ 
Binding at pH 7.5 Binding at pH 5.5 Transport in proteoliposomes 
Kg B max Ka B max 1-min uptake Kn, V max 
Isoform (uM) (mol CQ/mol Pf{CRT) (uM) (mol CQ/mol PfCRT) (nmol x mg”) (uM) (nmol x mg” x min”) 
Control - - - - 0.12 + 0.02 n.d.* n.d. 
7G8 0.28 + 0.06 0.38 + 0.02 0.21 + 0.06 0.40 + 0.03 3.14 + 0.50 0.77 + 0.15 92.20 + 4.91 
7G8+F1451 0.29 + 0.05 0.43 + 0.02 0.24 + 0.06 0.39 + 0.03 0.24 + 0.05 0.36 + 0.08 63.01 + 3.57 
7G8+C350R 0.30 + 0.06 0.42 + 0.03 0.13 + 0.03 0.37 + 0.02 0.80 + 0.21 1.43 + 0.16 17.97 + 0.87 
Dd2 0.31 + 0.04 0.38 + 0.01 0.20 + 0.04 0.42 + 0.04 3.33 + 0.22 0.64 + 0.07 91.37 + 2.79 
wt 0.28 + 0.04 0.42 + 0.02 0.28 + 0.03 0.40 + 0.02 0.58 + 0.08 n.d. n.d. 
PPQ 
Binding at pH 7.5 Binding at pH 5.5 Transport in proteoliposomes 
Ka B max Ka B max 1-min uptake Kn V max 
Isoform (uM) (mol CQ/mol PfCRT) (uM) (mol CQ/mol PfCRT) (nmol x mg”) (uM) (nmol x mg” x min”) 
Control - - - - 0.09 + 0.01 n.d. n.d. 
7G8 0.22 + 0.04 0.40 + 0.02 0.09 + 0.02 0.40 + 0.01 3.26 + 0.33 n.d. n.d. 
7G8+F1451 0.26 + 0.03 0.42 + 0.01 0.22 + 0.03 0.38 + 0.01 1.41 + 0.12 0.61 + 0.15 109.30 + 7.27 
7G8+C350R 0.25 + 0.04 0.41 + 0.02 0.21 + 0.03 0.39 + 0.02 0.82 + 0.07 0.53 + 0.04 153.00 + 3.10 
Dd2 0.22 + 0.03 0.39 + 0.02 0.08 + 0.01 0.39 + 0.01 3.09 + 0.41 n.d. n.d. 
wr 0.26 + 0.04 0.40 + 0.02 0.26 + 0.04 0.38 + 0.02 0.51 + 0.04 n.d. n.d. 


a, All mutations were identified in field isolates except for C101F, which was selected under PPQ pressure in Dd2 parasites. The PfCRT 7G8 and Dd2 haplotypes are listed in Extended Data Fig. 1. 
The drug first listed in the ‘Associated phenotype’ column is thought to have been the driver of selection and the susceptibility status is indicated; the drug listed in parentheses shows the 
associated phenotype. b, Dissociation constants (K,) and maximum binding constants (B,,,,,) were determined with equilibrium dialysis using 4.2 pmol of the indicated PfCRT variant in 50 mM 
Tris/MES, pH 7.5 or pH 5.5, 5% glycerol and 0.1 mM TCEP. Data (mean + s.e.m. of n = 3 independent experiments) were subjected to global nonlinear regression fitting in GraphPad Prism and are 
shown as the log-normalized mean + s.e.m. of the fit. Transport kinetics for [7H]CQ and [SH]PPQ were determined with PfCRT variants reconstituted into proteoliposomes or with liposomes (con- 
trol) using an inwardly directed pH gradient. Data (mean of n = 2 independent experiments of n = 4 technical replicates) for the 1-min uptake of [7H]CQ or [SH]PPQ (depicted in Fig. 3i) are shown 
for each variant tested. Data of the concentration-dependent uptake of [SH]CQ or [SH]PPQ (mean of n = 2 independent experiments of n= 4 technical replicates) shown in Fig. 3j were fitted to 
the Michaelis-Menten equation in GraphPad Prism to obtain the K,, and V,,,, (shown as mean + s.e.m. of the fit). 

‘These mutations were detected in P. falciparum parasites from areas of PPQ resistance, except for the lab-selected C101F mutation. Their contribution to PPQ resistance has been confirmed in 
cultured isolates or gene-edited parasite lines. 

Percentage identity of the amino acid present in P. falciparum wild-type (3D7) parasites, when examined across CRT orthologues from the six Plasmodium species listed in Extended Data Fig. 7. 
For P. falciparum, we only considered the 3D7 strain and not the CQ-resistant Dd2 and 7G8 strains. 

Percentage identity of the amino acid present in P. falciparum wild-type (3D7) parasites, when examined across CRT orthologues from the six Plasmodium species plus the five other repre- 
sentative apicomplexan species listed in Extended Data Fig. 7. For P. falciparum, we only considered the 3D7 strain and not the CQ-resistant Dd2 and 7G8 strains. 

*n.d., not determined because of low transport activities. 
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Extended Data Table 3 | Phenotypic parasite assays with pfcrt-modified P. falciparum parasite lines 


716878 7G St F141 AGG (SCs Dd2?” WT (3D7) 
PHICQ CAR 14.5+3.7 388.4 + 38.9 406.4 + 22.4 26.342.9 876.7 + 38.7 
n 4 4 = 6 4 
P vs 7G8"% = 0.03 0.03 - - 
P vs Dd2?” = = - = 0.01 
PHIPPQ CAR 1943.9 + 82.2 1125.6+59.6  1207.44119.8  1861.3+91.8 2123.8 + 72.2 
n 4 4 4 4 4 
P vs 7G8"% = 0.03 0.03 ce = 
P_vs Dd2?® = = = = 0.03 
CQICso 83.0 + 7.5 7.3+0.4 6.9 + 0.3 142.9+ 21.8 12.2+0.5 
N 4 4 4 4 4 
P vs 7G8"%8 - 0.03 0.03 be Z 
P vs Dd2?” - ~ = = 0.03 
md-CQ IC sp 919+ 125 47.8+2.5 36.5+2.0 1796 + 199 48.6 +3.1 
N 4 4 4 4 4 
P vs 7G8"% _ 0.03 0.03 - = 
P vs Dd2?® - = = = 0.03 
PPQ ICso 21.4+ 1.0 28.7+ 1.0 29.5+1.6 18.3+41.9 16.2413 
N 4 4 4 4 4 
P vs 7G8"% = 0.03 0.03 = es 
P vs Dd2?® - - - = 0.20 
PSA 1600 nM 0.54 + 0.20 14.74 42 5.9+0.9 0.87 + 0.10 0.51 + 0.17 
N 5 3 3 4 2 
P vs 7G8"% - 0.04 0.04 = = 
PSA 800 nM 0.56 + 0.17 13.1+3.9 6.6 + 0.46 0.89 + 0.16 0.62 + 0.16 
N 5 3 3 4 2 
P vs 7G8'% - 0.04 0.04 = = 
PSA 400 nM 0.66 + 0.20 12.742.5 10.5+1.1 1.14 0.24 0.61 + 0.19 
N 5 3 3 4 2 
P vs 7G8"% - 0.04 0.04 = - 
PSA 200 nM 1.4+ 0.2 8.6+42.4 9.5412 1.14 0.10 1.2+0.54 
N 5 3 3 4 2 
P vs 7G8"% - 0.04 0.04 = _ 
PSA 100 nM 1.6+ 0.3 4.0+0.6 17425 1.7+0.2 1.3 + 0.35 
N 5 3 3 4 2 
P vs 7G8"% _ 0.04 0.04 - a 
PSA 50 nM 4.94 1.3 85412 31.8+9.3 7.6+3.3 4.9411 
N 5 3 3 4 2 
P vs 7G8'% - 0.04 0.04 = = 
PSA 25 nM 48.3 45.5 44.5+42.4 62.1+ 9.6 35.3 412.2 11.0+1.7 
N 5 3 3 4 2 
Pvs7G8'% - 0.79 0.25 = = 
PSA 12.5nM 76.7+5.1 62.34 4.1 79.8 +5.5 67.7+ 11.9 F2+£15 
N 5 3 3 4 2 
P vs 7G8'% - 0.07 0.79 = = 
PSA 6.25nM 87.143.5 $3.44 4.5 90.7+5.1 $3.6+6.1 12.1+ 6.8 
N 5 3 3 4 2 
P vs 7G8"% - 0.07 0.79 = 2 
PSA 3.125 nM 93.8 + 3.5 95.7+ 3.1 96.6 + 2.4 95.9+2.1 81.2 + 19.6 
N 5 3 3 4 2 
P vs 7G8'% = 0.07 0.79 - = 


Cellular Accumulation Ratios (CARs) of [SH]CQ and [*H]PPQ in parasitised red blood cells indicate the mean + s.e.m., determined from 4-6 independent assays performed in duplicate. ICs, 
values (nM) indicate the mean + s.e.m., as determined by 4 independent assays performed in duplicate. PPQ survival assay (PSA) percentages are presented as the mean + s.e.m. determined 
from 2-5 independent assays performed in duplicate. Statistical significance was determined using two-tailed Mann-Whitney U-tests. P values are reported for comparisons with the isogenic 
parasite (control) lines 7G8’°° and Dd2°” (CAR), the isogenic parasite lines 7G8’°* or Dd2°“ (IC5,) or the isogenic parasite line 7G8’°° (PSA, PPQ survival assay). Cells are highlighted in grey if 
P<0.05. md-CQ, monodesethyl-chloroquine (the active CQ metabolite); n, number of assays. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
L— Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Leginon 3.4 


Data analysis For Cryo-EM data processing, the following software were used: Appion 3.0, MotionCor2, CTFFind 4, GCTF 0.56, Relion 2.1, CisTEM 1.0, 
CryoSPARC2, pyem 2019. 
For model building, the following software were used: Coot 0.8.9, Phenix 1.13, PYMOL 2.2, Chimera 1.13, Voss Volume Voxelator 1.0. 
For sequence alignment, the following software were used: OrthoMCL-DB 5, MUSCLE 3.8, ESPript 3. 
For data analysis (binding and transport studies), the following software was used: Prism 7 and 8; for data display, SigmaPlot 13 was used. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


All raw movie frames, micrographs, the particle stack and relevant metadata files have been deposited into EMPIAR, with accession code EMPIAR-10330. The 
electron density map has been deposited into EMDB, with accession code EMD-20806. The model has been deposited in the PDB, with accession code 6UKJ. All 
data are available in the manuscript or the supplementary materials. 
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Data exclusions | No data was excluded from the analyses. 


Replication Transport experiments were repeated at least 2 times, using 2 different batches of PfCRT-containing proteoliposomes. Binding studies were 
performed at least 3 times with protein (in nanodiscs) stemming from 2 individual preparations. Parasite data were generated on 2 to 5 
independent occasions. All parasite assays conducted were successful and were included in the final analysis. Since variations of the individual 
proteoliposome or nanodisc batches or cell culture preparations were within the standard experimental error, all replications were 
considered successful and reproducible and thus included in the analyses shown in the manuscript. 


Randomization |= Randomization was not applicable as the quality of the preparations (nanodiscs, proteoliposomes, cell cultures) could only be assessed by 
performing the respective experiments. 


Blinding Blinding was not applicable to the study as no research coordinator was instated to oversee the blinding process of the different experimental 
approaches performed in the different participating labs of this collaborative study. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 
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Antibodies 


Antibodies used Rabbit anti-GFP antibody (Invitrogen), goat anti-rabbit IgG conjugated to HRP (Abcam), horseradish peroxidase (HRP)-conjugated 
anti-M13 monoclonal antibody (GE Healthcare), HRP-conjugated mouse anti-human IgG F(ab’)2 monoclonal antibody (Jackson) 


Validation Rabbit anti-GFP antibody (Invitrogen) has been validated for Western Blot with 316 publications by the manufacturer. Goat anti- 
rabbit IgG conjugated to HRP (Abcam) has been validated for Western Blot with 1335 publications by the manufacturer. 
Horseradish peroxidase (HRP)-conjugated anti-M13 monoclonal antibody (GE Healthcare) has been validated by the 
manufacturer for ELISA use. HRP-conjugated mouse anti-human IgG F(ab’)2 monoclonal antibody (Jackson) has been validated 
for Western Blot with 7 publications by the manufacturer. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) HEK-293 cells (ATCC), HEK-293 GnTi- cells (Invitrogen), Sf9 cells (Expression System), Plasmodium falciparum parasite lines 
(NIH) 


Authentication None of the cell lines has been authenticated. 


Mycoplasma contamination The cell lines were not tested for mycoplasma contamination. 


Commonly misidentified lines No commonly misidentified cell lines were used. 
(See ICLAC register) 
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Host infection by pathogenic mycobacteria, such as Mycobacterium tuberculosis, is 
facilitated by virulence factors that are secreted by type VIl secretion systems’. A 
molecular understanding of the type VII secretion mechanism has been hampered 
owing toa lack of three-dimensional structures of the fully assembled secretion 
apparatus. Here we report the cryo-electron microscopy structure of amembrane- 
embedded core complex of the ESX-3/type VII secretion system from Mycobacterium 
smegmatis. The core of the ESX-3 secretion machine consists of four protein 
components—EccB3, EccC3, EccD3 and EccE3, in a 1:1:2:1 stoichiometry—which form 
two identical protomers. The EccC3 coupling protein comprises a flexible array of 
four ATPase domains, which are linked to the membrane througha stalk domain. The 
domain of unknown function (DUF) adjacent to the stalk is identified as an ATPase 
domain that is essential for secretion. EccB3 is predominantly periplasmatic, but a 
small segment crosses the membrane and contacts the stalk domain. This suggests 
that conformational changes in the stalk domain—triggered by substrate binding at 
the distal end of EccC3 and subsequent ATP hydrolysis in the DUF—could be coupled 
to substrate secretion to the periplasm. Our results reveal that the architecture of type 
Vil secretion systems differs markedly from that of other known secretion machines’, 
and provide a structural understanding of these systems that will be useful for the 
design of antimicrobial strategies that target bacterial virulence. 


Pathogenic mycobacteria contain up to five paralogous ESX/type VII 
secretion systems. Three of these systems—ESX-1, ESX-3 and ESX-5— 
mediate the secretion of specific sets of effector proteins that have 
defined roles in tuberculosis’. ESX-3 is expressed in response to iron- 
limiting conditions* and has been implicated in metal homeostasis? °, 
inhibition of T-helper cell (CD4*) activation, phagosome maturation 
and the repair of pathogen-induced phagosomal damage? “. 

ESX secretion systems feature a set of five conserved core membrane 
components (EccB, EccC, EccD, EccE and MycP, where Ecc denotes ESX 
conserved component), which mediate the secretion of the EsxA:EsxB 
family of virulence factors” ” or of DNA’’. Biochemical and structural 
studies of ESX-5 demonstrated that four of these components (EccB, 
EccC, EccD and EccE) assemble into a stable hexameric secretion pore 
inthe cell envelope, whereas MycP—a membrane-anchored protease—is 
not tightly associated with the stable core’. The coupling protein 
EccC recognizes effector proteins in the cytoplasm and energizes their 
transport”. 

Recent work has provided initial insights into the structural features 
of mycobacterial type VII secretion systems. Crystal structures of solu- 
ble domains of EccB, EccC and EccD have been determined”’™™. A low- 
resolution negative-stain electron microscopy (EM) structure of ESX-5 
revealed that the type VII secretion system organizes into hexamers”; 


however, the use of staining agents limited the available information to 
the external contour shape of the complex. Here we used single-particle 
cryo-EM to reveal the structure of the membrane-embedded ESX-3/ 
type VII core complex from M. smegmatis, the core-complex-forming 
components of which (EccB3-EccE3) share 40.4-74.7% sequence iden- 
tity with those of M. tuberculosis. 

To gain insights into the molecular architecture of the type VII secre- 
tion system we expressed the ESX-3 gene cluster of M. smegmatis, which 
encodes the secreted substrates and cytosolic components alongside 
the components for the membrane-embedded secretion machinery 
(plasmid pMyNT:ESX-3; Extended Data Fig. 1a). A core complex of 
around 900 kDa was the most abundant species in terms of molecular 
mass in detergent-solubilized membrane extracts; subsequent purifi- 
cation revealed the ESX-3 core complex consisting of the membrane 
components EccB3, EccC3, EccD3 and EccE3 (Extended Data Fig. 1b, c). 
Aminimal expression construct (pMyNT:Mini), encoding only the mem- 
brane proteins, enabled the purification of this ESX-3 core complex in 
high yields (Fig. 1a, b, right lanes, Extended Data Fig. 1a, d). 

For cryo-EM, the sample was exchanged into the surfactant Amphi- 
pol A8-35 and was further purified by size-exclusion chromatogra- 
phy. In addition to the main peak at 900 kDa, we also observed larger 
oligomeric species that were heterogeneous in shape and size and 
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Fig. 1| Purification and cryo-EM structure of the ESX-3 core complex. a, The 
size-exclusion chromatogram after exchange of the ESX-3 complex into the 
surfactant Amphipol A8-35 revealed two peaks, Pl and P2 (Extended Data 

Fig. 1d), which were analysed by BN-PAGE. The results are representative of 
more than three independent experiments. b, SDS-PAGE analysis of peak 1 (P1) 
and peak 2 (P2) of the ESX-3 core complex expressed in M. smegmatis from the 
minimal expression construct pMyNT:Mini (encoding only the membrane 
components of the system). The results are representative of more than three 
independent experiments. c, Representative 2D class average of the 900-kDa 
ESX-3 complex (peak 1). White arrows indicate flexible regions. Scale bar, 10 nm. 
The top inset shows the 2D average of the fork obtained after subtraction from 


thus were unsuitable for structure determination. However, afew two- 
dimensional (2D) averages that were derived from these oligomers were 
compatible with a complex of 25 nm in diameter—a size comparable 
to that of the ESX-5 hexamer’ (Fig. 1a, b, Extended Data Fig. 1d-g). 

By contrast, the peak at around 900 kDa arose from a homogenous 
complex. The 2D class averages show a central prominent density with 
the distinctive shape of amembrane-spanning region. Densities on 
opposite sides of the membrane—which we termed ‘fork’ and ‘arms’ 
(Fig. 1c)—appear blurred andindicate flexibility. Independent processing 
of the fork revealed that this region is rigid, but hasa flexible attachment 
to the membrane (Fig. 1c, Extended Data Fig. 1h). 

The structure of the 900-kDa ESX-3 core complex was determined to 
aresolution of 3.8 A, following a specific classification and refinement 
strategy for each region of the map” (Fig. 1d, Extended Data Figs. 1g, 
h, 2-5). Amerged reconstruction of the complex was built using one 
possible orientation for the fork and one of the conformations for the 
arms (Extended Data Figs. 1h, 3d). The composite map shows two similar 
protomers interacting closely ina dimeric structure. A fork structure 
composed of two identical monomers is connected to the membrane 
(Fig. 1d). Some flexibility between the protomers was detected, and 
therefore the structures of each single protomer was resolved indepen- 
dentlytoimprove resolution (Supplementary Tables 2,3). The resolution 
of protomer 1 reached 3.7 A, improving the definition of several regions 
in comparison with the structure of the dimeric core complex (Extended 
Data Figs. 2, 3b, 4b). 
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Protomer 1 


the images. d, Cryo-EM density map of the ESX-3 core complex built after 
merging the information of three processing strategies (Extended Data Fig. 2). 
Inthe model, EccB3 was placed in one orientation with respect tothe membrane 
while maintaining continuity with the EccB3 TMHs inserted into the membrane. 
The four core components are indicated in different colours: EccB3, green; 
EccC3, blue and light blue for flexible regions; EccD3 dimer, monomer 1(Mon1) 
and monomer 2 (Mon 2) in orange and red, respectively; EccE3, purple. Scale bar, 
2nm. Left, side view; middle, front view; right, top view. EccB3 was removed for 
clarity. e, Schematic detailing the results of crosslinking experiments. The black 
bars indicate predicted transmembrane helices. MD, membrane domain. The 
domains of EccB3 are labelled R1, R2, C, R3 and R4. 


Onthe basis of the cryo-EM density—which shows side-chain informa- 
tion in most regions (Extended Data Fig. 6)—and the results of crosslink- 
ing mass spectrometry analysis (Fig. le), we modelled the structure for 
most of ESX-3 (Fig. 2). The observed crosslinks indicated the proximity 
of asmall hydrophilic domain of EccD3 to both EccC3 and EccE3, but 
no crosslinks to EccB3 were found. We therefore assigned the soluble 
domains of EccC3, EccD3 and EccE3 to the large segment of density 
comprising two cytoplasmic arms, whereas the soluble domains of 
EccB3 reside in the smaller segment that forms the fork onthe opposite 
side of the membrane layer (Fig. 2a. b). 

Two copies of an EccB3 homology model (comprising domains R1-R4 
with a putative peptidoglycan binding fold and domain C) fit into the 
fork (cross-correlation > 0.87) (Fig. 2b), which supports the previous 
proposal that EccB appends the type VII secretion system to the peri- 
plasmic peptidoglycan layer’. The EccB3 dimer interface is formed by 
domains R1, R2 and C. Density is observed connecting the periplasmic 
domains to their respective transmembrane helices, which anchor each 
of the two EccB3 monomers in the membrane (Fig. 2c). 

Each protomer contains 25 transmembrane helices (TMHs), compris- 
ing two copies of the EccD3 transmembrane domain (2 x 11 TMHs) for 
each copy of EccB3 (1 TMH) and EccE3 (2 TMHs) (Fig. 2d). Two EccD3 
monomers interact in an antiparallel orientation to provide the scaf- 
fold for the ESX-3 protomer. Inthe membrane, TMH 11 of the two EccD3 
monomers interacts with either EccB3 or EccE3. The EccD3 membrane 
domains are connected to their respective N-terminal ubiquitin-like 
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Fig. 2| Overview of the atomic structure of the ESX-3 core complex. a, The 
rigid core of the ESX-3 dimer, excluding the flexible regions (the fork and tip of 
the arms). b, Left, fit of ahomology model of EccB3 into the 4.6 A resolution 
cryo-EM map. Top, side view; bottom, top view. Right, schematic of the EccB3 
domain structure. c, Anchoring of EccB3 to the membrane of the ESX-3 dimer, 
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Fig. 3 | Structure and function of the ESX-3/type VII secretion system. a, Side 
view of the protomer1membrane region. b, Domain structure of EccC3, showing 
the stalk domain and the interaction of the DUF domain with the two Ubl 
domains of EccD3.c, The plasmid-encoded ESX-3 gene cluster (pMyNT:ESX-3) 
mediates secretion of the substrate heterodimer EsxG-EsxH into the culture 
filtrate of the transformed knockout strain M. smegmatis AESX-3. Left, whole- 
cell lysate (WCL) and culture filtrate (CF) of M. smegmatis AESX-3 transformants 
containing the following plasmids were analysed by western blot. Lane 1, 
pMyNT:no insert; lane 2, DMyNT:ESX-3; lane 3, pMyNT:ESX-3-AEccE3; lane 4, 
pMyNT:ESX-3-EccB3A fork; lane 5, pMyNT:ESX-3-EccC3ADIII; lane 6, DMyNT:ESX- 
3-EccC3-D319; lane 7, pMyNT:ESX-3-EccC-D320; lane 8, pMyNT:ESX-3-EccC3- 
R342T; lane 9, purified EsxxG-EsxH. Loading and lysis control, RNAP (RNA 
polymerase B subunit; around 150 kDa). Right, a magnified view of the structure 
ofthe DUF domain, highlighting residues that are mutated inthe secretion assay. 
The western blots are representative of two biological replicates. 
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showing the TMH and the N-terminal helix (NTH). d, Top view of protomer 1 
membrane region with the assignment of all transmembrane helices. The EccD3 
dimers are shown in orange and red. The structure of protomer 1 was obtained 
after particle subtraction (Extended Data Figs. 2,3), and the density for EccC3 
TMH, shown in blue, is slightly better defined than inthe dimer. 


(Ubl) domains through TMH1 (Fig. 3a). Inthe cytoplasm, the Ub! domains 
and their linkers to the membrane domains form ascaffold for the cyto- 
plasmic domains of EccC3 and EccE3. 

Two N-terminal TMHs insert EccC3 into the membrane. We detect 
partial density for these EccC3 TMHs, which could not be modelled 
completely (Figs. 2a, d, 3a, b, Extended Data Fig. 6b). We found that 
a previously unassigned stalk domain connects the EccC3 cytosolic 
domains to the membrane. The stalk comprises two a-helices anda 
short antiparallel B-sheet and makes electrostatic interactions with the 
EccB3 N-terminal helix, which runs almost parallel to the membrane 
(Figs. 2c, 3a, b). This interaction suggests that conformational changes 
of the stalk could be coupled to changes in EccB3. The stalk is followed 
by aconserved domain that is designated in literature as the domain of 
unknown function (DUF)”°. The DUF structure exhibits an ATPase fold, 
whichis in an ATP-free state. 

The flexible arms in ESX-3 could be visualized only in reconstruc- 
tions that were obtained after re-centring the particles (Extended Data 
Figs. 2b, 3,5). This re-centring strategy was sufficient to fit a homology 
model of the EccC3 DI ATPase domain within the density thatis in direct 
contact with the DUF domain (cross-correlation > 0.87) (Fig. 3b). Clas- 
sification of the particles revealed that the subsequent EccC3 ATPase 
domains Dll and DIlIl are extremely flexible, and only partial density for 
the DII domain was detected. 

Crosslinking experiments followed by mass spectrometry analysis 
indicated that the soluble domains of EccC3, EccD3 and EccE3 are located 
closetoeach other; the upper cytoplasmic arm was therefore assigned to 
component EccE3, givenits localization proximal to the two Ub! domains 
of EccD3 and the crosslinks to these domains (Fig. le). EccE3 is appended 
to the membrane by two TMHs and forms an extensive interface with 
the EccD3 Ub! domain dimer in the cytoplasm (Figs. 2d, 3a). The core 
of the EccE3 cytoplasmic domain shows a mixed a/B-fold with a central 
antiparallel B-sheet surrounded by several a-helices. 

Overall, the structure of the ESX-3 core complex comprises two identi- 
cal protomers—each consisting of one copy of EccB3, EccC3 and EccE3 
and two copies of EccD3. The EccB3 dimer is crucial for holding both 
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Fig. 4| Model of the oligomeric ESX-3 secretion machine. a, Side and top 
views of the model for the ESX-3 oligomer. A vestibule-like structure is formed 
by the five EccC3 domains: stalk, DUF, DI, Dll and DIII. b, Inner membrane pore 
and vestibule structure of the ESX-3 oligomer. The inner membrane pore is 
composed of EccB3 and EccC3 TMHs and the stalk domains of EccC3. c, Model 


protomers together, as the periplasmic domain of each EccB3 monomer 
is linked toa TMH thatis firmly anchored within the respective protomer 
(Figs. 1d, 2a, c). 

We performed several experiments to validate our structure 
(Extended Data Fig. 7a—-h). First, we confirmed that the purified mem- 
brane-embedded secretion machine is competent for secretion, and 
established an assay to monitor ESX-3-dependent secretion into the 
culture medium using antiserum against the two effector proteins, 
EsxG and EsxH. We used an M. smegmatis ESX-3 knockout strain that is 
defective in secretion’. Complementation of this strain with plasmid 
pMyNT:ESX-3, which encodes the ESX-3 gene cluster (Extended Data 
Fig. 7a), restored secretion (Fig. 3c, lanes 1, 2). Analysis of detergent- 
solubilized cell envelopes of the transformed knockout strain by blue 
native-polyacrylamide gel electrophoresis (BN-PAGE), western blot 
and subsequent purification confirmed the assembly of the 900-kDa 
ESX-3 core complex (Extended Data Fig. 7b, c). 

We removed the periplasmic domain from EccB3 (using 
pMyNT:ESX-3) and expressed it in secretion-deficient M. smegmatis 
AFSX-3. We then purified the EccB3Afork—EccC3-EccD3-EccE3 com- 
plex and found that the molecular mass was shifted towards that of 
a single protomer (Extended Data Fig. 7h). The same plasmid did not 
mediate the secretion of EsxG and EsxH, indicating that the periplas- 
mic domain of EccB3 stabilizes the dimer and is essential for secretion 
(Fig. 3c, lane 4). 

In order to investigate the function of EccE3, we removed its corre- 
sponding gene (eccE3) from the plasmid-encoded ESX-3 gene cluster 
(pMyNT:ESX-3) and expressed the remaining genes in M. smegmatis 
AFSX-3. Purification of the EccB3-EccC3-EccD3 subcomplex revealed 
that it assembled into a dimer, although it was prone to dissocia- 
tion (Extended Data Fig. 7h). This plasmid did not mediate secretion 
(Fig. 3c, lane 3), indicating that EccE3 is a scaffold component of the 
ESX-3 protomer. We conclude that the stability of the ESX-3 protomers 
is critical for the function of the ESX-3 secretion system. 
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for the mechanism of secretion in ESX-3/type VII secretion system. The 
membrane pore is formed by EccC3, and contacts between EccC3 and EccB3 
facilitate the coupling of ATP hydrolysis and substrate binding to changesin 
conformation in both proteins and the opening of the pore. 


Wealsoinvestigated the function of EccC3. Deletion of the DIII domain 
of EccC3 completely abolished secretion (Fig. 3c, lane 5), inagreement 
with observations in the staphylococcal type VIlb secretion system”®. 
The DUF domain contains a canonical and conserved Walker B motif, 
whereas the Walker A motif (336-GXXXXHRT-343) includes non-canon- 
ical residues in positions 341 (H for G) and 342 (R for K). We examined 
the effect of mutations in these Walker A and B motifs (Fig. 3c). D319A 
and D320A mutations abolished secretion, confirming that the DUF 
is an ATPase domain with an essential function in secretion, whereas 
the R342T mutation in the non-canonical Walker A motif had no effect 
(Fig. 3c, lanes6-8). The domainarchitecture of type VII coupling proteins 
thus comprises a stalk domain that connectsalinear array of four ATPase 
domains (DUF, DI, Dll and DIII) tothe membrane (Fig. 3b). Three out of 
the four ATPase domains are identified as the main constituents of the 
arms inthe electron microscopy density (Figs. 2a, 3a, b); the density for 
Dillis missing inthe reconstruction owing toits high degree of flexibility. 

Unlike the hexameric ESX-5, we extracted ESX-3 ina dimeric state. We 
tested whether we could detect larger oligomers using our ‘high yield’ 
minimal expression construct (pMyNT:Mini, ECCD3cyrep). BN-PAGE of 
detergent-solubilized membrane extracts revealed species with a higher 
molecular mass than that of the dimers (Extended Data Fig. 7i). These 
speciesincrease in amount whenacrosslinking agent is used, suggesting 
that ESX-3 forms larger complexes inthe membranes that are disrupted 
during our extraction protocol. As a control, large oligomers are not 
found in the membranes of mycobacteria that express only EccD3. 

The cryo-EM structure of the ESX-3 core complex revealed that EccB3, 
EccC3 and EccD3 and EccE3 are present in a 1:1:2:1 stoichiometry. This is 
distinct from the 1:1:1:1 ratio proposed for ESX-5, which was measured 
by intensity-based absolute quantification mass spectrometry (iBAQ 
MS)”. This method was not sufficiently sensitive to determine the stoi- 
chiometry of ESX-3 (Extended Data Fig. 7f). Notably, the ratio of subunits 
in ESX-3 extracted from secretion-competent cells—as estimated by 
Oriole staining—agrees with the 1:1:2:1 stoichiometry obtained from the 


cryo-EM structure of ESX-3 (Extended Data Fig. 7e). Furthermore, size- 
exclusion chromatography experiments found that EccD3 alone elutes 
as a dimer of approximately 100 kDa (2 x 48 kDa) (Extended Data Fig. 7g). 
Using the low-resolution map of ESX-5 as a template, three ESX-3 
dimers can be fitted by following the contour shape provided by ESX-5 
to model the ESX-3 hexamer (Fig. 4a, b, Extended Data Fig. 8). The inte- 
tior of the membrane pore has a diameter of around 25 A andis formed 
by two transmembrane helices of EccC3, for which we observe partial 
density (Extended Data Fig. 6b). The secreted EsxG-EsxH heterodimer 
of the ESX-3 system, as well as the known substrate structures of other 
ESX systems, have a diameter of 22 A. The secretion pore in our model 
is therefore sufficiently wide for the translocation of folded substrates. 
On the cytoplasmic side of the proposed secretion pore, six copies of 
the coupling component EccC3 pack into a unique vestibule-like struc- 
ture that consists ofa stalk domain and four ATPase domains of EccC3 
(DUF, DI, Dil and DIlII). The architecture of the DUF and stalk domains 
linked to amembrane-traversing helix that connects toa periplasmic 
domain is reminiscent of ABC (subfamily F) transporters (Extended 
Data Fig. 9). The DI, DII and DIII domains are flexible, and this flex- 
ibility increases towards the outermost layer of the vestibule, which 
is composed of six DIII domains. Structure-function studies on the 
homologous EccC coupling protein from Thermomonospora curvata 
showed that the ATPase activity of DI is modulated by its interaction 
with domain DIl, and that peptides representing the type VII secretion 
signal bind to domain DIII*°. 
TheextensivecontactsbetweenthestalkdomainandthetwoN-terminal 
helices in EccB3 suggest that conformational changes that are triggered 
by substrate binding to the DIII domain and subsequent ATP hydrolysis 
of the DUF could be directly transmitted to open or close the membrane 
pore as well as the periplasmic exit formed by EccB3 (Fig. 4c). 
Insummary, the detailed architecture of a type VII secretion system 
reveals the oligomerization of a building block comprising two tightly 
interconnected protomers to forma secretion pore, whichis sufficient 
in size to enable the translocation of substrate heterodimers in their 
folded state (Extended Data Fig. 10). Classical antibiotics function 
by inhibiting essential aspects of prokaryotic biology. Pathogenic 
bacteria secrete virulence factors that help them to infect the host 
and evade the actions of the immune system. Targeting the systems 
responsible for secretion could provide an alternative antimicrobial 
strategy. Here we provide a structural and functional understanding 
of the architecture of type VII secretion systems, which will aid the 
design of strategies that target the secretion of virulence factors in 
Mycobacterium and in other bacteria for which pathogenicity depends 
on type VII secretion systems. 
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METHODS 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Molecular biology 

For the generation of the ESX-3 expression plasmids used in this study, 
fragments of the ESX-3 gene cluster of M. smegmatis mc7155 (eccA3 to 
eccE3) were amplified by PCR (CloneAmp HiFi PCR premix, Takara), 
combined and cloned into the vector pMyNT using restriction-free liga- 
tion (In-Fusion HD cloning kit, Takara). The following In-Fusion primers 
were used for amplification (Supplementary Table 1): (i) X390/X391 
(eccA3-eccC3); (ii) -X392/X391 (eccB3-eccC3); (iii) -X396/X394 (peS- 
eccE3); (iiib) -X396/X399 (peS-MycP3); (iv) -X393/X394 (espG3-eccE3); 
(v) -X395/X394 (eccD3-eccE3); (vi) -X390/NX391 (eccA3-eccC3AD3).A 
DNA fragment encoding a C-terminal His-Tag (-GSMGGSHHHHHH*) 
was introduced to eccC3 using oligomers X388/X389 (vii). The backbone 
of the pMyNT vector was amplified using the primer pairs X386/X387 
(viii) or M37/M38 (ix). In order to clone the ESX-3 gene cluster encoding 
a His-Tag on the 3’ end of eccC3, fragments (i) and (vii) were inserted 
into the pMyNT backbone (viii) first. The resulting construct was then 
linearized using the restriction enzyme SnaBl followed by insertion of 
fragment (iii) to produce plasmid pMyNT:ESX-3 or fragment (iii b) to 
produce the pMyNT:ESX-3AEccE3 variant. For the generation of plasmid 
pMyNT:ESX3i, we excluded the acetamidase promoter region of the 
pMyNT:ESX3 using primers X430/X426 and replaced it with the IdeR 
promoter amplified with primers X427/X428. 

To generate construct pMyNT:ESX-3AS (lacking all substrate genes of 
theESX-3 gene cluster), fragments (i) and (vii) were cloned intothe pMyNT 
backbone (viii). The resulting construct was digested with SnaBl followed 
by ligation of fragment (iv). Plasmid pMyNT:ESX-3ASC was produced by 
insertion of fragments (ii) and (vii) into the pMyNT backbone (viii). After 
digestion by SnaBI, fragment (v) was cloned into the construct. 

To generate plasmid pMyNT:Mini, the multi-cloning site of the pMyNT 
vector, downstream of the acetamidase ribosome-binding site, was 
replaced with the oligonucleotide pair X397/X398. The genes eccB3, 
mycP3and eccE3 were amplified from the genomic DNA of M. smegmatis 
mc’155 using In-Fusion primers X266/X267, X307/X308 and X275/X276. 
Gene eccC3 encoding a 3’-terminal His-tag (-GSMGGSHHHHHH*) was 
amplified using In-Fusion primers X311/X312 and eccD3 encoding an 
N-terminal Strep-tag II (MASWSHPQFEKGS-) was amplified using In- 
Fusion primers X274/X273. Genes were cloned via the following restric- 
tion sites: EcoRV for eccB3, Hindlll for eccC3, SnaBl for eccD3 (N-terminal 
Strep-tagll), Scal for mycP3 and Hpal for eccE3. 

To generate plasmid pMyNT:Strepll-EccD3 the multi-cloning site of 
the pMyNT vector was removed by inverse PCR using primers X42/X43. 
Oligonucleotides (X44/X44C) encoding the Strep-tag Il and the BamHI 
restriction site were annealed and then inserted into the linearized vec- 
tor by ligation. Gene eccD3 was amplified with In-Fusion primers X96/ 
X97 and cloned into the pMyNT plasmid via the BamHI restriction site. 

To generate construct pMyNT:ESX-3-EccC3ADIII (lacking the DIII 
domain of EccC3), fragments (vi) and (vii) were cloned into the pMyNT 
backbone (viii). The resulting construct was digested with SnaBI followed 
by ligation of fragment (iii). 

The plasmid EccB3Afork was produced by amplifying the following 
fragments using pMyNT:ESX-3 as template: NX1/NX2 (pMyNT-eccA3- 
eccB3Afork), NX3/NX4 (eccB3Afork-eccE3), NX5/NX6 (eccE3-pMyNT). 

For generation of plasmids pMyNT:ESX-3-EccC3-D319A, pMyNT:ESX- 
3-EccC3-D320A and pMyNT:ESX-3-EccC3-R342T—containing point 
mutations in the Walker A or B motifs of the DUF of EccC3—the vec- 
tor pMyNT was linearized by PCR first using primer pair M37/M38 (ix). 
Fragments (x) (eccA3-eccC3) and (xi) (eccC3-eccE3) were amplified 
from pMyNT:ESX-3 using In-Fusion primers M39/M40 or M47/M48, 


respectively. Oligonucleotides encoding parts of the eccC3 gene com- 
prising point mutations D319A (M41/M42), D320A (M43/M44) or R342T 
(M45/M46) were annealed and ligated together with fragment (x) into 
the pMyNT backbone (ix). The resulting constructs were digested with 
SnaBl followed by ligation of fragment (xi). 


Protein overexpression and purification of the ESX-3 dimer and 
the ESX-3 higher oligomer 

M. smegmatis mc7155 was transformed with the respective plasmid 
and cultured in LB (Luria/Miller) medium supplemented with 0.05% 
(v/v) Tween 80 and 0.2% (v/v) glycerol at 150 r.p.m. Induction was per- 
formed with 0.2% (w/v) acetamide (or 200 uM 2,2’-dipyridyl in the case 
of pMyNT:ESX-3i) at an optical density at 600 nm (OD goon) Of 0.5 and 
cells were grown until a final optical density of 1.4-1.6. Cells were pel- 
leted by centrifugation and washed in 1x PBS buffer. The cell pellet was 
resuspended in buffer A (30 mM HEPES pH 8.0, 300 mM NaCl and 10% 
glycerol) supplemented with EDTA-free protease inhibitors (Roche). 
Cells were lysed by three passages through an Emulsiflex-C3 homog- 
enizer (Avestin). Unlysed cells were removed by centrifugation (10 min, 
10,000g). The membrane fraction was separated by ultracentrifugation 
(1h, 100,000g). Membranes were solubilized in buffer A supplemented 
with 0.5% TeDM (Anatrace) at 4 °C for 1h. Insoluble material was removed 
by ultracentrifugation (1h, 100,000g). 

The sample was supplemented with30 mM imidazole and loaded ona 
HisTrap column (GE Healthcare). For wash and elution, buffer Asupple- 
mented with 0.0022% TeDM was used with30 mM and 250 mM imidazole, 
respectively. TeDM was exchanged for Amphipol A8-35 at a 1:3 ratio of 
protein to amphipol. TeDM was removed by incubation with BioBeads 
(Bio-Rad) overnight. In order to separate the different oligomeric states 
of the complex, size-exclusion chromatography was performed using a 
Superose 6 Increase 10/300GL column (GE Healthcare). Fractions were 
analysed by SDS-PAGE and BN-PAGE followed by Colloidal Blue staining. 


Protein expression and purification of the EccD3 dimer 

M. smegmatis mc?155 was transformed with the pMyNT:Strep-EccD3 
and cultured in LB medium supplemented with 0.05% (v/v) Tween 80 
and 0.2% (v/v) glycerol at 150 r.p.m. Induction was performed with 0.2% 
(w/v) acetamide at an ODgoonm Of 0.5 and cells were grown until a final 
optical density of 1.4-1.6 was reached. Cells were pelleted by centrifu- 
gation and washed in 1 PBS buffer. The cell pellet was resuspendedin 
buffer A (SO mM Tris pH 8.0, 300 mM NaC)). Cells were lysed by three 
passages through an Emulsiflex-C3 homogenizer (Avestin). Unlysed 
cells were removed by centrifugation (10 min, 10,000g). The membrane 
fraction was separated by ultracentrifugation (1h, 100,000g). Mem- 
branes were solubilized in buffer A supplemented with 1% n-dodecyl- 
B-D-maltopyranoside (DDM; Anatrace) at 4 °C for 1h. Insoluble material 
was removed by ultracentrifugation (1h, 100,000g). 

The sample was loaded ona StrepTrap column (GE Healthcare). For 
wash and elution, buffer A was supplemented with 0.05% DDM and with 
0.05% DDM and 2.5 mM D-desthiobiotin, respectively. The eluted protein 
was subjected to size-exclusion chromatography using a Superose 6 
Increase 10/300GL column (GE Healthcare). 


Generation of polyclonal antiserum against the substrates EsxG 
and EsxH 

The EsxG-EsxH protein complex of M. smegmatis was expressed from 
plasmid pMAPLe3, which was provided by the Eisenberg laboratory 
(University of California, Los Angeles), and purified to homogeneity 
as described previously’. Purified EsxG-EsxH was used to immunize 
rabbits (immunoGlobe). The polyclonal antiserum was subjected to 
affinity purification using a column coupled to the purified antigen. 


Protein secretion assay 
M. smegmatis strains were grown in 7H9-Tween medium at 37 °C toan 
ODgoonm Of 0.8. For the analysis of whole-cell lysates, cells were pelleted by 


centrifugation (5,000g, 10 min), washed with PBS buffer and disrupted 
by bead-beating with 0.1 mm glass beads (Sigma). Samples were sup- 
plemented with Novex Tricine SDS Sample Buffer (Invitrogen), boiled for 
2 min and then separated ona 10-20% Novex Tricine gel for analysis by 
western blotting. Culture supernatants were centrifuged twice (5,000g, 
4°C,10 min) and passed through a 0.2-um filter to remove cells that were 
not lysed. Proteins were precipitated by the addition of 10% trichloro- 
acetic acid (overnight, 4 °C) followed by centrifugation (10,000g, 4 °C, 
15 min). The precipitate was washed with ice-cold acetone, resuspended 
in Novex Tricine SDS Sample Buffer, boiled for 2 min and then separated 
ona10-20% Novex Tricine gel. The amount of whole-cell lysates loaded 
correspond to 0.26 OD¢oonm units of cells whereas the culture superna- 
tants loaded correspond to 7.5 x 0.26 OD¢oonm Units Of cells. Subsequent 
western blots were carried out according to the manufacturer’s proto- 
col (Invitrogen). Polyvinylidene fluoride membranes were incubated 
either using polyclonal rabbit serum against the EsxG-EsxH protein 
complex (approximately 4.5 pg mI) or amonoclonal mouse antibody 
against the RNA polymerase B subunit (1:2,000; BioLegend). Secondary 
horseradish peroxidase (HRP)-conjugated antibodies were used at the 
following concentrations: anti-rabbit IgG-HRP (1:10,000; Carl Roth), 
and anti-mouse IgG-HRP (1:10,000; Carl Roth). Proteins were visualized 
using the Enhanced Chemiluminescence substrate kit (Pierce) anda 
FujiFilm LAS-3000 luminescent image analyser. Band intensities were 
quantified using the Image} software”®. 


Cryo-EM sample preparation 

The sample was concentrated to 0.3 mg mI and a volume of 3.5 pl 
was applied on glow-discharged holey carbon grids (Quantifoil Cu/Rh 
R1.2/1.3 400 mesh). Excess liquid was removed by blotting for 3 s (blot 
force —10) using filter paper followed by plunge freezing in liquid ethane 
using a FEI Vitrobot Mark IV at 100% humidity at 4 °C. 


Cryo-EM data acquisition and image processing of larger 
complexes 

For the analysis of the large complexes eluting in the void volume of 
the size-exclusion chromatography experiment (Extended Data Fig. 1), 
images were collected on a 200-kV FEI Talos Arctica electron micro- 
scope equipped witha Falcon III direct electron detector at the Spanish 
National Centre for Biotechnology (CNB-CSIC, Madrid). Images were 
recorded using the integrative mode and a nominal magnification of 
75,000x, corresponding to 1.42 A per pixel at the specimen level. A total 
dose of 50 electrons per A? was used, fractionated in 50 frames over a 
2.98-s exposure. Images were collected with a nominal defocus range 
of 1.4-3.2 um. 

Video stacks (3,387) were corrected for drift (5 x 5 patches) and dose- 
weighted using MotionCor2”. The contrast transfer function parameters 
were determined for the drift-corrected micrographs using Gctf. A first 
set of 2D-references were generated from manually picked particlesin 
Relion* and these were then used for subsequent automatic particle 
picking using Gautomatch (provided by K. Zhang, Yale University). A 
number of automatically selected particles (241,990) were extracted 
and thoroughly analysed using 2D-classification routines available in 
Relion and cryoSPARC™. A subset of 36,055 particles, corresponding to 
side views of the multimeric complex, was aligned using a 2D-reference 
and subsequently classified in 2D using a mask to focus classification 
onthe density outside the membrane region. The dimensions of arep- 
resentative average with density at both sides of the membrane region 
were measured. 


Cryo-EM data acquisition of dimeric complexes 

Aninitial dataset of the purified ESX-3 complex was collected at The 
Netherlands Center for Electron Nanoscopy. Processing of these data 
reached 7 A resolution for the core of the dimer (Extended Data Fig. 2a). 
For the high-resolution structure, micrographs were collected at the 
Rudolf Virchow Center, Universitat Wurzburg using a300 kV FEI Titan 


Krios electron microscope (Extended Data Fig. 2b). Micrographs were 
recorded using a Falcon III direct electron detector in counting mode 
(Supplementary Table 2). 

Videos (11,903) were recorded with an underfocus of 1.6 to 2.6 um, a 
calibrated magnification of 1.0635 A per pixel, a dose of lelectron per 
A’ per frame, accumulating a total dose of 50 electrons per A’ fraction- 
ated in 55 frames. 


High-resolution cryo-EM data processing 

The dataset was processed making use of tools available in Relion ver- 
sions 2.1and 3.0”, cryoSPARC”, cisTEM”™ and Scipion™. Initially, videos 
coming from the different sessions were aligned and local motion was 
corrected using MotionCor2 with dose weighting. Micrographs that 
exhibited defects inthe Thon rings due to excessive drift, ice contami- 
nation or astigmatism were discarded. Details can be found in Sup- 
plementary Table 2. 

Particles (2,066,007) were automatically selected and submitted 
to several rounds of 2D-classification in order to discard bad particles 
using Relion (Extended Data Fig. 2b). After cleaning, 325,205 particles 
were used in all subsequent analyses. Next, an initial three-dimensional 
(3D) model was generated from the selected particles using routines 
availablein Relion and this volume was used to further classify each of the 
datasets in 3D. After classification 165,412 particles were refined using 
Relion to generate a cryo-EM map with an estimated average resolu- 
tion of 3.8A, using gold-standard refinement methods and the Fourier 
shell correlation (FSC) cut-off of 0.143 (Extended Data Fig. 4). Local 
resolution ranges were analysed within Relion. Refinement of one of 
the monomers after particle subtraction to remove the influence of the 
other monomer during refinement yielded 3.7 Aand further improved 
the resolution of some regions. 

The cryo-EM map of the ESX-3 complex exhibited a lack of detail at 
both ends of the complex (tip of the arms and the fork), duein part to 
flexibility but possibly also to the increasing distance from the centre 
of the box. It has been observed previously that the accuracy of the 
alignments improves at the centre of rotation compared to more distal 
regions, especially in flexible and large multi-component macromolecu- 
lar complexes. We applied are-centring strategy in which the group of 
325,205 particles previously selected was re-extracted and centred in 
either of the two distal regions of the map (Extended Data Fig. 2b), as 
detailed below. For the distal arm region, the selected particles centred 
at this region were classified using a mask that accounted for only the 
variability of the density corresponding to the most distal region of 
the molecule, and this classification was performed without further 
alignment of the particles. These two classes included 80,173 and 69,919 
particles each, and they were further processed using gold-standard 
refinement methods inRelionto generate cryo-EM mapsat5.4Aand5.3A 
resolution (gold-standard FSC = 0.143 criterion) (Extended Data Fig. 5). 

Similarly, particles were subsequently re-extracted and centred inthe 
fork region (Extended Data Fig. 2b). Particles were then subtracted in 
order to remove the density that did not correspond to the fork. These 
particles were analysed using the ab initio routine available in Relion, 
and without assuming any symmetry. One of the maps, corresponding to 
55,342 particles, exhibited features for the fork already observed inthe 
2D averages and preliminary processing of the data, and it was selected 
for further refinement using Relion and applying C2 symmetry. The 
map showed an estimated average resolution of 4.6 A (gold-standard 
FSC=0.143 criterion) (Extended Data Fig. 4). Differences in resolution 
across the reconstructed map were analysed using the local resolution 
routine available in Relion. 


Model building and refinement 

The programs RaptorX”, I-Tasser® or Phyre2” were used to generate 
homology models of the periplasmic domain of EccB3 (amino acids (aa) 
101-518), the DUF (aa 201-400), the three ATPase domains DI-DIII (aa 
401-1325), Ubl domain of EccD3 (aa 6-97) based on crystal structures of 
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the periplasmic domain of EccB1 from M. tuberculosis (PDBID:3X3M), 
the DUF (PDB ID: 2IUUA), the UbI domain of EccD1 from M. tuberculosis 
(PDB ID: 4KV2) and the three ATPase domains of EccC from T. curvata 
(PDB ID: 4NOH) as templates, respectively. As a first step, rigid-body 
fitting of these models into the cryo-EM density was performed using 
Chimera®. To improve the fit of the DUF and the UbI domains, five cycles 
of morphing were carried out using Phenix Refine v1.12” before parts 
of the structures were rebuilt using Coot*®. De novo models of the cyto- 
plasmic segment and the transmembrane region of EccB3 (aa 9-89), the 
N-terminal stalk domain of EccC3 (aa 2-44 and 89-401), the transmem- 
brane domain of EccD3 (98-471) and EccE3 (aa 2-285) were built guided 
by Quick2D" and Haruspex*. EccC3 DI is located at the flexible arms 
and the resolution in one of the structures obtained after re-centring 
of the particles indicated clear secondary-structure elements and was 
sufficient to fit a homology model with high cross-correlation (>0.87). 

For EccC3 domain DIl, density is present only in protomer 1 without 
information on the secondary structure. A homology model of DII was 
fitted into this density as a rigid body using Chimera. Density for DIII (aa 
1052-1325) was missing in both protomers. EccC3 contains 2 TMHs for 
which partial density was detected in the map of the ESX-3 monomer. For 
TMHI, aa36-45 of the helix were built, whereas no density was present 
for aa46-62. A polyalanine model of TMH2 (aa 72-90) was placed into 
the density and rigid-body fitting and morphing were carried out using 
Phenix Refine v1.12 (Extended Data Fig. 6b). 

Model building and structure refinement were performed iteratively 
and using the information from the map of the dimer and the monomer. 
Five macro cycles of real-space and grouped B factor refinement of main 
and side chains of the entire model of the ESX-3 core complex were 
performed in Phenix Refine v1.12 at 3.7 A resolution using restraints for 
secondary structure, rotamers as well as Ramachandran angles. Statis- 
tics for the model can be found in Supplementary Tables 2, 3. 

To model the ESX-3 oligomer, a composite EM map of the 
EccB3:EccC3:EccD3:EccE3 complex was chosen, in which the EccB3 
dimer was rotated by 10° along the two-fold symmetry axis with respect 
tothe membrane layer permitting the fit of all three copies of this com- 
posite map into the ESX-5 EM map without clashes (EMDB-3596). To 
model the ATPase domains Dll and DIII, the crystal structure of DI-DIII 
(PDBID: 4NOH) was superimposed onto Dl inthe ESX-3 structure placing 
Dilinto the density adjacent to DI. 


Crosslinking mass spectrometry of ESX-3 

The purified ESX-3 core complex was crosslinked using the N-hydrox- 
ysuccinimide (NHS) ester disuccinimidyl dibutyric urea (DSBU, also 
knownas BuUrBu). The crosslinking reactions were incubated for 45 min 
at room temperature at a final excess of 100-, 50- and 25-fold of DSBU 
with respect to the protein concentration. The reactions were quenched 
by adding NH,HCO, toa final concentration of 50 mM and incubating 
for 15 min. 

The crosslinked samples were freeze-dried and resuspended in50 mM 
NH,HCO;, reduced with 10 mM dithiothreitol and alkylated with 50 mM 
iodoacetamide. After alkylation, proteins were digested with trypsin 
(Promega), at an enzyme-to-substrate ratio of 1:20, overnight at 37 °C. 
The samples were acidified with formic acid toa final concentration of 
2% (v/v) and the peptides fractionated by peptide size-exclusion chro- 
matography, using a Superdex Peptide 3.2/300 column (GE Healthcare) 
with 30% (v/v) acetonitrile/ 0.1% (v/v) trifluoroacetic acid as mobile 
phase ata flow rate of 50 pI min”. Fractions were collected every 2 min 
with elution volumes of 1.0 ml to 1.7 ml, lyophilized and resuspended 
in 2% (v/v) acetonitrile and 2% (v/v) formic acid. 

Fractionswereanalysedbynanoscalecapillaryliquidchromatography- 
tandem mass spectrometry (LC-MS/MS) using an Ultimate U3000 HPLC 
(Thermo Scientific Dionex) to deliver a flow of approximately 300 nl 
min“. A C18 Acclaim PepMap100 5 pm, 100 pm x 20 mm nanoViper 
(Thermo Scientific Dionex), trapped the peptides before separation on 
aC18 Acclaim PepMap100 3 pm, 75 pm x 250 mm nanoViper (Thermo 


Scientific Dionex). Peptides were eluted with a gradient of acetonitrile. 
The analytical column outlet was directly interfaced via a nanoflow 
electrospray ionization source, with a hybrid quadrupole Orbitrap 
mass spectrometer (Q-Exactive HF-X, Thermo Scientific). MS data 
were acquired in data-dependent mode. High-resolution (R) fullscans 
(R=120,000, m/z350-2,000) were recorded inthe Orbitrap and after 
H activation (stepped collision energy 30 + 3) of the five most intense 
MS peaks, MS/MS scans (R = 15,000) were acquired. 

For data analysis, Xcalibur raw files were converted into the MGF 
format through MSConvert (Proteowizard)* and used directly as input 
files for Merox**. Searches were performed against an ad hoc protein 
database containing the sequences of the complexes and a set of ran- 
domized decoy sequences generated by the software. The following 
parameters were set for the searches: maximum number of missed 
cleavages 3; targeted residues K, S, Y and T; minimum peptide length 
5 amino acids; variable modifications: carbamidomethyl-Cys (mass 
shift 57.02146 Da), Met-oxidation (mass shift 15.99491 Da); BuUrBu- 
modification fragments: 85.05276 Da and 111.03203 Da (precision: 5 
ppm MS and 10 ppm MS); false-discovery rate cut-off: 5%. Finally, each 
fragmentation spectrum was manually inspected and validated. 


Mass determination of purified EccD3 using size-exclusion 
chromatography coupled to multi-angle light scattering 

The experiments were performed at room temperature using a Super- 
ose 6 Increase 10/300 GL column coupled to an AKTA Purifier system 
(GE Healthcare) in-line with Dawn 8+ MALS and Optilab T-rEX refractive 
index detectors (Wyatt Technology). Purified EccD3 (100 pl, lmg mI) 
was applied to a Superose-6 Increase size-exclusion column (10/300 GL) 
ina buffer containing 50 mM Tris pH 8.0, 300 mM NaCl and 0.05% (w/v) 
DDM. The molecular mass was determined with the ASTRA 6 software 
(Wyatt Technology). For analysis of the protein conjugates, we used a 
refractive index increment (dn/dc) of 0.185 ml g“ for the protein frac- 
tion and 0.133 for DDM. 


Stoichiometric analyses of ESX-3 protein complexes by the iBAQ 
method and Oriol staining 

For in-gel digestion, the excised gel slices were destained with 30% ace- 
tonitrile, shrunk with 100% acetonitrile, and dried in a vacuum con- 
centrator. Trypsin digestion was performed overnight at 37 °C in 0.05 
M NH,HCO, (pH 8), using 0.1 pg of protease per slice. Peptides were 
extracted from the gel slices with 5% formic acid. 

NanoLC-MS/MS analyses were performed on an Orbitrap Velos Pro 
(Thermo Scientific) equipped with a PicoView lon Source (New Objec- 
tive) and coupled to an EASY-nLC 1000 (Thermo Scientific). Peptides 
were loaded on capillary columns (PicoFrit, 30 cm x 150 umi.d., New 
Objective) packed with ReproSil-Pur 120 C18-AQ 1.9 um (Dr. Maisch), 
and separated with a30 min linear gradient from 3% to 30% acetonitrile 
and 0.1% formic acid at a flow rate of 500 nl min”. 

MS scans were acquired in the Orbitrap analyser with a resolution of 
30,000 at m/z400; MS/MS scans were acquired in the Orbitrap analyser 
with a resolution of 7,500 at m/z 400 using higher-energy collisional 
dissociation fragmentation with 30% normalized collision energy. A 
TOPS data-dependent MS/MS method was used; dynamic exclusion 
was applied with a repeat count of land an exclusion duration of 30 s; 
singly charged precursors were excluded from selection. The minimum 
signal threshold for precursor selection was set to 50,000. Predictive 
automatic gain control was used witha target value of le6 for MS scans 
and 5e4 for MS/MS scans. The lock mass option was applied for internal 
calibration in all runs using background ions from protonated deca- 
methylcyclopentasiloxane (m/z371.10124). 

For processing of the raw data files, database searches and quantifica- 
tion, MaxQuant version 1.5.7.4 was used. The search was performed 
against the M. smegmatis reference proteome databases (UniProt) 
and, additionally, a database containing common contaminants. The 
search was performed with tryptic cleavage specificity with three 


allowed miscleavages. Protein identification was under control of the 
false-discovery rate (<1% on protein and peptide level). In addition to 
MaxQuant default settings, the search was performed allowing the 
following variable modifications: protein N-terminal acetylation, Gin 
to pyro-Glu formation (N-terminal Gln), and oxidation (Met). Carbami- 
domethylation (C) was set as fixed modification. iBAQ intensities were 
used for protein quantitation. Proteins with less than two identified 
razor/unique peptides were dismissed. 

The purified ESX-3 core complex was separated by SDS-PAGE. Gels 
were Stained with Oriol (Bio-Rad) according to the manufacturer’s 
instructions. Protein band intensity was measured with the Gel Doc 
XR+ (Bio-Rad) using the UV transilluminator. Image] was used for 
quantification”®. 


Crosslinking of ESX-3 

Crosslinking experiments were carried out using the same protocol 
as for the ESX-5 core complex®. In brief, mycobacterial membranes 
were isolated as described in the purification section, resuspended in 
PBS containing 250 mM sucrose, and incubated with 1 mM disuccin- 
imidyl suberate (DSS) or dimethylsulfoxide (as a control) for 30 minon 
ice. The crosslinking reaction was quenched by the addition of 1OO mM 
glycine, 10 mM Na,HPO, (pH 8.5) for 30 min. The crosslinked proteins 
were extracted fromthe membrane by detergent solubilization with 0.5% 
DDM for Lhonice. Aggregates and insoluble material were removed by 
ultracentrifugation (100,000g, 1h). The solubilized membrane proteins 
were separated by BN-PAGE using NativePAGE Novex 3-12% BisTris gels 
followed by western blot analysis using antibodies against the Strep-tag 
Il (StrepMAB-Classic HRP conjugate, IBA Lifesciences) according to the 
manufacturer’s protocol (Invitrogen). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Cryo-EM maps have been deposited in the Electron Microscopy Database 
under accession codes EMD-10186 (core complex dimer), EMD-10187 
(protomer 1), EMD-10189 (conformation 1), EMD-10190 (conformation 
2) and EMD-10188 (dimer of EccB3). The model of the ESX-3 core com- 
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Bank under PDB accession codes 6SGW, 6SGX, and 6SGY, respectively. 
Thecryo-EM map and model for protomer 2 (discussed but not shown, 
because it has lower resolution than that of protomer 1) are deposited 
under accession codes EMD-10191 and 6SGZ. All gel source data are 
provided in Supplementary Fig. 1, and all other data are available from 
the corresponding author upon reasonable request. 
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Extended Data Fig. 1| See next page for caption. 


Extended Data Fig. 1| Purification and cryo-EM of ESX-3 protein complexes. 
a, Overview of the constructs tested for co-expression studies. The illustrated 
gene cassettes were cloned into plasmid pMyNT and expressed under the 
control of the acetamidase promoter (Ace). Ace RBS, acetamidase ribosome- 
binding site. From top to bottom: the ESX-3 gene cluster (pDMyNT:ESX-3), the ESX- 
3 gene cluster without substrate genes (pMyNT:ESX-3AS), without genes 
encoding substrates and cytosolic proteins (pMyNT:ESX-3ASC), only the ESX-3 
membrane proteins (pMyNT:Mini), the membrane proteins EccB3, EccC3, 
EccD3, EccE3 but not MycP3 (pMyNT:MiniAMycP3). b, Western blot of TeDM- 
solubilized membrane extracts containing ESX-3¢ccc3-Hiss Complexes expressedin 
M. smegmatis mc7155 from constructs listed ina and separated by BN-PAGE. 
Anti-His,-tag antibodies were used to detect the His-tagged ESX-3 complexes. 
The results represent three independent experiments. c, The ESX-3 core 
complex was expressed in wild-type M. smegmatis mc7155 from plasmid 


pMyNT:ESX-3 encoding the entire ESX-3 gene cluster and purified; analysis by 
BN-PAGE (left) and SDS-PAGE (right) is shown. The results represent three 
independent experiments. d, Size-exclusion chromatogram (Superose-6 
Increase 10/300 GL) after the TeDM-to-Amphipol A8-35 exchange of the ESX-3 
core complex. The positions of peak 1 (P1) and peak 2 (P2) are indicated with 
arrows. The results represent three independent experiments. e, Overview 
micrograph of the higher oligomeric ESX-3 core complex (>1.2 MDa; P1). Some 
particles are highlighted by circles. Scale bar, 50 nm. f, Representative 2D class 
average of the larger ESX-3 species. The membrane region is indicated witha 
bracket. g, Overview micrograph of the 900-kDaESX-3 core complex (P2). Some 
particles are highlighted by circles. Scale bar, 50 nm. h, Several 2D averages of 
the ESX-3 core complex after focused centring on the flexible fork. A flexible 
attachment of the fork to the membrane region can be observed. 
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Extended Data Fig. 2 | Image processing and classification strategy for the dataset was collected for high-resolution analysis. After removing bad particles, 
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(EccB3). This analysis suggests that EccB3 is involved in dimerization. b, A larger 
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the density using Phenix Refine”. c, Plot revealing that density for the two EccC3 
TMHsis well defined only at one end of the helix. Notably, the density for one of 
the EccC3 TMHsis better defined in the structure of the subtracted monomer, 

and this is represented inthe panels showing the monomer structure in Figs. 2, 3. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | Validation experiments. Experiments to validate that 
the ESX-3 core complex extracted from actively secreting mycobacteria is the 
sameas the complex for which the structure was solved. We investigated the 
expression of the ESX-3 complex in the mycobacterial membrane, the 
stoichiometry of the extracted and purified complex and the orientation of the 
complex inthe membrane. a, Overview of the constructs used in secretion 
assays and for the validation of the structure of the ESX-3 core complex. Plasmid 
pMyNT:ESX-3 encodes the ESX-3 gene cluster under acetamidase promoter 
control, and pMyNT:ESX-3i under control of the native IdeR promotor. The 
expression of the latter construct is induced iniron-depleted culture medium. 
b, BN-PAGE and western blot analyses (EccC3-His) indicate the presence of the 
900-kDaESX-3 core complex inthe membrane of secretion-competent cells that 
contain the ESX-3 gene cluster under control of either the acetamidase promoter 
or the native IdeR promoter (see a). The gel represents two independent 
experiments. c, Comparison of size-exclusion chromatography profiles of ESX-3 
core complexes purified from secretion-competent cells transformed with 
plasmids pMyNT:ESX-3 and pMyNT:ESX-3i (both expressed in M. smegmatis 
AESX-3; see a), with the ESX-3 core complex purified from the minimal 
expression construct (pMyNT:Mini) expressed in wild-type M. smegmatis, which 
was used to determine the cryo EM structure (Extended Data Fig. 1a). The 
chromatogramsare representative of three independent purifications. d, ESX-3 
core complexes purified from secretion-competent M. smegmatis AESX-3 
transformed with either pMyNT:ESX-3 or pMyNT:ESX-3i, in the presence and 
absence of B-mercaptoethanol. A disulfide bridge is found in the EccB3 protein 
in complexes purified from secretion-competent cells. The two positions for 
EccB3 in SDS-PAGE are indicated. In the structure of the ESX-3 core complex, 
EccB3 is periplasmic and contributes to the assembly between protomers. We 
observe a different mobility for EccB3 in the ESX-3 core complex purified from 
secretion-competent cells when the SDS-PAGE is performed in the presence or 
absence of B-mercaptoethanol. This is compatible with an intramolecular 
disulfide bridge in EccB3, as expected in the oxidative environment of the 
periplasm. The gel represents two independent experiments. e, Determination 
of the stoichiometry of ESX-3 complexes in secretion-competent M. smegmatis 
AESX-3 transformed with either pMyNT:ESX-3 or pMyNT:ESX-3i. Oriole staining 
of the SDS-PAGE bands—in order to estimate the ratio of subunits obtained— 
showed that EccD3 is over-represented, in agreement with the observed 1:1:2:1 
stoichiometry in the structure of the ESX-3 core complex. SDS-PAGE shows that 
the density of the bands of EccD3 is higher than those of EccE3, despite both 


proteins having a similar molecular mass. The intensities of the bands were 
integrated and the relative ratio of subunits was estimated, taking into account 
the differences in molecular mass of the proteins. In the bar chart, the bars are 
the reported means and the standard deviations are shown as dots. The results 
indicated a stoichiometry of 1:1:2:1 (EccB3:EccC3:EccD3:EccE3). The gel shows 
three technical replicates and is representative of two independent 
experiments. f, Analysis of the composition of the protein complex subunit 
using iBAQ. The size of the dots correlates with the number of identified unique 
peptides. The iBAQ-MS method was, in our experiments, not sufficiently 
sensitive to define the stoichiometry of ESX-3. We used the ESX-3 complex from 
our structural studies as an internal control because the stoichiometry of the 
ESX-3 structure was fully determined as 1:1:2:1 at the high resolution of our cryo- 
EM maps. The iBAQ method estimates a 1:1:1:1 stoichiometry for the purified 
complex that we use to resolve the structure by cryo-EM, thus indicating that the 
method does not report the correct stoichiometry for ESX-3 in these 
experiments. When the stoichiometry of the native ESX-3 core complex in 
secretion-competent cells was estimated by the iBAQ method used previously 
for the ESX-5 core complex”, a1:1:1:1 ratio was also obtained (data not shown). 
g, Analysis of purified EccD3 using size-exclusion chromatography (Superose-6 
Increase 10/300 GLcolumn) coupled with multi-angle light scattering. The lines 
show the deconvolution of the contributions of the EccD3-DDM protein- 
detergent complex (dot-dashed line), the protein (dashed line) and the detergent 
micelles (dotted line) to the total mass of the complex. This experiment 
demonstrates that EccD3 dimerizes as observed in structure of the ESX-3 core 
complex. The experimentis representative of two technical replicates. 

h, Analysis of the derivatives of the ESX-3 core complex by size-exclusion 
chromatography. Removing the periplasmic fork of EccB3 or EccE3 leads to 
dissociation towards single protomers, showing that both components are 
essential for the stability of the ESX-3 complex. The elution peak of the ESX-3 
core complex dimer is indicated. The chromatograms are representative of two 
independent purifications. i, Isolated mycobacterial membranes from wild-type 
M. smegmatis containing the ‘high yield’ minimal expression construct 
(pMyNT:Mini) and extraction of ESX-3 core complexes after crosslinking using 
DSS. ESX-3 complexes were analysed by BN-PAGE and western blot analyses 
(EccD3-Strepll). Asacontrol, complexes formed by EccD3 expressed alone were 
also analysed simultaneously. Only ESX-3 assembled into large complexes, 
including those of higher molecular mass (higher MW species) than the ESX-3 
dimer. The gel is representative of two independent experiments. 
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Extended Data Fig. 8 | Building the ESX-3 hexamer using the ESX-5 negative- 
stain structure asatemplate.a, Modelling of the ESX-3 secretion machine. The 
negative-stain map of the hexameric ESX-5 core complex (EMDB-3596) was used 
to generate a model for the ESX-3 secretion machine”. For this, the rigid core of 
the ESX-3 dimer (without the flexible arm and fork, shown in red) was fitted into 
ESX-5 (shownas white transparent density) using the contour information of the 


2 dimers fitted in ESX-5 


SIDE VIEWS 


3 dimers fitted in ESX-5 


Periplasmic side 


Cytosolic side 


negative-stain structure. Subsequently, another two dimers were fitted in the 
remaining subunits to forma hexamer. The atomic structure of each ESX-3 dimer 
fits approximately the dimensions of ESX-5, as shown in the top and side views of 
ESX-5. Because the EMDB-3596 reconstruction was generated assuming a 
perfect six-fold symmetry, all subunits in the ESX-5 complex are identical, 
whereas the ESX-3 hexamer is based ona trimer of dimers. 
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(EccC3) 

Extended Data Fig. 9 | The structure of the EccC3/EccB3 contact is 5NIL)**. The open (ATP unbound) state of MacB is shown. ATP binding and 
reminiscent of ABC type F transporters. a, Side-by-side comparison between hydrolysis trigger long-range conformational changes in the stalk domains and 
the structure of the DUF and stalk domains of EccC3 interacting with EccB3 in the periplasmic domains of the MacB dimer, which close and open the 


the structure of the ESX-3 core complex, andthe MacB ABCtransporter(PDBID: | transmembrane channelas wellas the periplasmic exit. 
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Extended Data Fig. 10 | Comparison between the negative-stainstructureof structure elements and internal details are not resolved in negative-stain 
ESX-5 and the cryo-EMstructure of ESX-3. a, Side and top views of the ESX-5 structures. EccC5 and EccBS are not visualized in the structure, possibly owing 
structure, which has been solved previously (EMDB-3596)”. The structure was to their flexibility and the use of six-fold symmetry. b, Side and top views of the 
obtained after applying six-fold symmetry. The use of staining agents limitsthe | ESX-3model (this work). 

obtainable information to the external shape of the complex, while secondary 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


x| The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


x A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


x] 
UO Oo 


x A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 
r Oo A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
4 AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 
OQ For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
a Give P values as exact values whenever suitable. 
x For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
x For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 
x Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Cryo-EM: FEI EPU 1.7.0; Western Blots: Fujifilm Imager software v3.0. 


Data analysis Cryo-EM data were processed and analyzed using RELION v2.1 and 3.0, cryoSPARC v2, cisTEM, SCIPION, LocalDeblur, MotionCor2, Gctf 
and Gautomatch, Chimera v1.14, Coot v0.8.9.2. COOT was used for model building. 


Mass spectrometry data were processed with MSConvert (Proteowizard) and MeroxX software. 
SEC-MALS data were analyzed with ASTRA 6 software (Wyatt Technology) and Graphpad Prism v6.0h. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


Cryo-EM maps corresponding to the central core region dimer, protomer 1, protomer 2, cytoplasmic distal region, and the fork have been deposited in the Electron 
Microscopy Database under accession codes EMD-10186, EMD-10187, EMD-10191, EMD-10189, EMD-10190 and EMD-10188 respectively and the model of the 
ESX-3 core complex, protomer 1, protomer 2 and the fork under accession code PDB ID 6SGW, PDB ID 6SGX, PDB ID 6SGZ, PDB ID 6SGY, respectively. 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


[x | Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size All experiments were replicated two or more times. Three proteins samples were independently purified 


for cryo-EM data collection and structure determination of the ESX-3 core complex. For structure determination, the sample size was 
determined by the selected number of particles which yielded the cryo-EM map with highest resolution. 
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Data exclusions No data were excluded from the analyses. 


Replication All experiments were replicated successfully.We have provided a detailed documentation of the performed experiments as well as the 
procedure to derive the cryo-EM maps and the structural models. All reagents used in this study are commercially available. The cryo-EM data 
processing and structure modelling software used in this study is freely available. 


Randomization Samples were not allocated to groups. 


Blinding During data collection and analysis, investigators were not blinded because this procedure is not commonly applied in cryo-EM. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
[x] Antibodies x ChIP-seq 

[x] L] Eukaryotic cell lines [x]}[] Flow cytometry 

[x] [| Palaeontology [x]}[_] MRI-based neuroimaging 
[x]|[_] Animals and other organisms 

x Human research participants 

x]|[_] Clinical data 


Antibodies 


Antibodies used Monoclonal Anti-polyhistidine mouse antibody peroxidase conjugate (clone HIS-1; Merck, catalog# A7058, lot #027M4768V; 
1:5000). 
Monoclonal Anti-RNA polymerase mouse (RNAP) antibody (clone 8RB13; Biolgend, catalog# 663903, lot# B219269;1:2000). 
Monoclonal Anti Strep-tag II peroxidase conjugate (iba life-sciences, catalog# 2-1509-001, dilution: 1:6.000) 
Mouse IgG-HRP (GE Healthcare, catalog# NXA931, dilution: 1:10.000) 
Anti Rabbit-HRP (Carl Roth, catalog# 4750.1, lot# 4326, dilution: 1:10.000) 


Polyclonal anti-(EsxG/ EsxH) rabbit serum was generated in collaboration with immunoGlobe GmbH (Himmelstadt, Germany) as 
described in the method section. 


Validation Specificity of polyclonal anti-(EsxG/ EsxH) rabbit serum was validated by comparison of culture media of wild type 
Mycobacterium smegmatis to a M. smegmatis AESX-3 knockout strain that lacks genes esxG and esxH as well as by comparison 
to the purifed antigens as shown in Figure 1a. 


O 


D 


Monoclonal anti-polyhistidine antibody peroxidase conjugate was validated by comparison to an M. smegmatis strain (carrying 
an empty pMyNT vector) that does not express His-tagged proteins (Extended Data Figure 1d). 


The specificity of the Monoclonal anti-RNAP was validated in Bergendahl V, et al. 2003 (Protein Expr. Purif. 31:155.). 
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with both ‘Champalimaud Research, Champalimaud Centre for the 
Unknown, Lisbon, Portugal’ (affiliation 1) and ‘Instituto de Medicina 
Molecular Joao Lobo Antunes, Faculdade de Medicina, Universidade 
de Lisboa, Lisbon, Portugal’ (originally affiliation 4, now renumbered 
as affiliation 2), instead of only affiliation 1. Affiliations 2-4 have been 
renumbered accordingly. The original Letter has been corrected online. 
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In Fig. 1 of this Article, the label for residue 24 in peptide 4 should have 
been ‘Thr’ instead of ‘Dab’, and in peptide 5, the labels for the‘N’ and ‘C’ 
termini should have been reversed (with the C terminus next to Dab, 
and the Nterminus next to Trp). This figure has been corrected online. 
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Adrian Smith’s footage of trap-jaw ants (genus Odontomachus) was a surprise hit on YouTube. 


BITES, CAMERA, 


ACTION... 


Filming my work with ants has changed my 
science-communication tactics. By Adrian Smith 


ine months ago, in my research 

laboratory at the North Carolina 

Museum of Natural Science, in 

Raleigh, I filmed myself being bit- 

ten by an ant. It wasn’t a bite from 
an ordinary, everyday ant. It was my main 
study organism, a trap-jaw ant of the genus 
Odontomachus, with jaws that snap shut faster 
than almost any other recorded animal move- 
ment. It’s so fast that visualizing it requires 
filming at a minimum of 60,000 frames per 
second. When I show high-speed videos of 
these ants, and talk about them, inevitably 
I’m met with the question: “Would it hurt if 
they snapped against you?” That’s a question 
that was answered almost immediately for 
me when I started working with them eight 
years ago: no. They'd snap their tiny jaws at my 
hands and bounce off, nearly unnoticed, while 


Iscavenged through excavated nest soilinthe 
field or cleaned their nest boxes in the lab. 
But that question inspired me to expand 
my approach to communicating science. 
Until then, I had focused on imparting the 
ends of my scientific pursuits — the research 
results — but had overlooked opportunities 
to get across the fundamental, and often 
more exciting, aspects of my research: the 
initial experiences and observations. I real- 
ized that I had more research stories to tell, 


“The force of the strike 
deflects offmy unharmed 
finger back tothe ant, 
sending her flying.” 


© 2019 Springer Nature Limited. All rights reserved. 


beyond just the final results. So, Iswitched on 
my high-speed camera and stuck my finger in 
front of a trap-jaw ant. 


Snapping the snap 


The four-and-a-half-minute video shows, in 
super-slow motion and in close-up, macro 
detail, a trap-jaw ant snapping against the tip 
of my finger (see go.nature.com/2daSdy). The 
force of the strike deflects off my unharmed 
finger back to the ant, sending her flying off 
the platform she was standing on. It’s ashot 
that no one had filmed before, to my knowl- 
edge, contextualizing the snap of these ants 
onahumanscale. The remainder of the video 
is me in the lab adding a “how” and “why” 
narrative to the footage. It’s not a video pre- 
senting a scientific result: it’s a video about 
what it’s like to experience and observe these 
ants as a researcher. I uploaded it to my lab’s 
YouTube channel and, to my surprise, a pro- 
duction company making a show for the US 
television channel Animal Planet saw it and 
asked to license the story. The full video will 
beshown onthe channel’s How Do Animals Do 
That? programme. The licensing fee that came 
to my lab funded asummer resident assistant’s 
stipend for a graduate student. 

Before all this happened, |assumed that the 
value of aresearcher’s science-communication 
output depended on published research find- 
ings — not on personal experiences. Its value 
was not monetary; rather, it was assessed by 
the impact of its results, the level of press 
exposure, or the extent to which it generated 
excitement and appreciation for science. 
Most of my output had been videos summa- 
rizing articles that | and a few colleagues had 
published in scientific journals. I’d attach 
these to an institutional press release — and if 
mass-media outlets picked up the release and 
associated media, I'd have an opportunity to 
communicate my work toa broad audience. 


Tip of the iceberg 
These products worked, and | did find a lot 
of value in communicating research in this 
way (I wrote a column about this process last 
year: A. A. Smith Nature 556, 397-398; 2018). 
But it was a struggle to release more than two 
or three videos a year, because they were 
dependent on research projects progressing 
through peer review and onto publication. I 
was communicating the peer-reviewed tip of 
ascientific iceberg. The rest of my work, some 
of the most interesting parts, remained untold 
and hidden from view. 

At its core, my job as a research scientist 
is to try to see and interpret the world ina 
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way that has not been attempted before. As 
a behavioural ecologist, I begin most of my 
research projects by trying to make and docu- 
mentan original observation about the life ofan 
organism. But many of my observations end up 
never being documented in my peer-reviewed 
manuscripts. They might, for example, raise 
questions well beyond my area of expertise; or 
it could be (especially with observations that I 
was lucky to make) that trying to replicatethem 
inascientific study is just too time-consuming 
or costly. But lam starting to realize that none 
of those reasons prevents me from turning 
those observations into interesting stories that 
can drive public interest in science, and that 
have value beyond the rather narrow world of 
academic behavioural ecology. 

Take my recent research experience with ant 
stingers. We know little about the mechanics of 
anant sting. For instance, no one has ever filmed 
how venom is delivered out of the stinger. So, I 
spent a week perfecting camera techniques to 
gather slow-motion video of these microscopic 
bits of ant anatomy in action. What I recorded 
were the first detailed videos of venom being 
pumped from the stinger. From this footage, I 
could assess what was and wasn’t measurable, 
and what expertise I’d need from a collabora- 
torto move this project forwards. Beyond that, 
however, this bit of observational science was a 
fascinating story that I had to share. 


Stinger zinger 

I collated, edited and uploaded a 3.5-minute 
video showing the footage and explaining why 
thought it was original and interesting. Whenit 
debuted, lalso published a short Twitter thread 
(see go.nature.com/35kjfy) summarizing the 
video and providing some excerpts. The thread 
went viral, trending on the site’s news feed and 
gathering more than two million impressions 
onthe platform. That week, I spoke about the 
footage on two local news broadcasts; and 
several mass-media news outlets, including 
Science, published articles embedding video 
of the stinger footage that I had just captured. 
On YouTube, the footage garnered more than 
250,000 views, and messages appeared in my 
inbox from people who had seen the footage 
and wanted to work with me. 

I’m not sure whether these ant-stinger 
observations will lead to a peer-reviewed, 
published piece of science. | hope they do, but 
it wouldn’t be unusual if this project failed to 
progress to that stage. What I do knowis that 
expanding my science-communication efforts 
to pass on more thanjust the end products of 
my scholarly work has added a lot of value to 
what I produce as a scientist. 


Adrian Smith is head of the Evolutionary 
Biology & Behavior Research lab at the North 
Carolina Museum of Natural Sciences and 

a research assistant professor in biology at 
North Carolina State University in Raleigh. 
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Mike Flannigan 
Fire researcher 


Mike Flannigan is a professor with the 
Department of Renewable Resources at 
the University of Alberta, Canada. Here he 
speaks to Nature about his work and the 
boom in wildfires worldwide. 


How did you get into wildfire research? 
I’ve always been interested in fire. When! 
turned one year old, | burnt my finger on 
the candle of my birthday cake because 
was fascinated by the flame. And I’ve been 
tasting fire ever since. You might call me 
apyrophile. 

I did an undergraduate physics 
degree at the University of Manitoba 
in Canada and, after a one-year course 
in meteorology, | worked as a weather 
forecaster for the national weather 
service from 1979 to 1981.1 then joined 
the Canadian Forest Service (CFS) asa 
physical scientist. After completing a 
PhD in plant sciences at the University of 
Cambridge, UK, in1993, | returned to work 
at CFS for 31 years. NowI’ma fire professor 
at Alberta. 


What have you learnt from your research? 
In Canada, the area burnt by wildfires 

has doubled since 1970. They previously 
burnt around one million hectares a year. 
Since 1990, we have been averaging more 
than 2.5 million hectares a year, and this is 
because of climate change. In the western 
United States, the area burnt has actually 
quadrupled since the 1970s, in large part 
because of climate change. 


Awildfire burns near Ashcroft, Canada. 
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Has the increase created more 
opportunities for wildfire researchers? 

Yes. Canada, for example, is developing a 
Can$50-million (US$37.6-million) satellite 
for detecting and monitoring wildfires. As 
aresearch team at Alberta, we hope that we 
will soon get Can$5 million in state-funded 
fire research. Things have changed in the 
field. We are actually getting funding now. 


What kinds of scientist can enter the field? 
Fire is a fairly broad topic. Meteorologists, 
physicists, atmospheric scientists, 
engineers, forestry specialists, 
geographers, computer-science 
programmers — you name it — could 

get into the field. But fire also has 

an operational side. Canada spends 
Can$800 milliona year directly on 

fire management, so there area lot of 
professional positions in fire management. 
Federal, provincial and state conservation 
agencies hire research scientists, technical 
analysts and field technicians. Forest 
services, parks and land-management 
agencies are also recruiting. If the topic of 
your thesis includes something directly 

or indirectly related to fire, then you have 
an opportunity to get ajob. 


What should the public know about fire? 
Fire is a global issue. Around the world, 
350 million to 450 million hectares of land 
— close to the size of India — burn every 
year. In many regions, we are seeing fires 
that are more difficult to extinguish, and we 
will probably continue to experience more 
catastrophic events and, unfortunately, 
more loss of life. As an international 
community, we have to learn to live with 
fire and to address it globally. Because 
people start some 90% of wildfires, we 
need to be careful and observant. If you 
see a wildfire, report it right away to the 
authorities. We should also consider 
volunteerism. Programmes such as 
Firewise USA and FireSmart in Canada help 
homeowners and communities to protect 
themselves against fire through education, 
guidelines and more. Such organizations 
are always looking for volunteers. 


Interview by Stav Dimitropoulos 
This interview has been edited for length 
and clarity. 
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Wherel work 
Mona Nemer 


Photographed for Nature 
by Rémi Thériault. 
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alking re-energizes me when | 
have difficult decisions to make. 
Ihave a lot of them: as chief 
science adviser to the Canadian 
government, my role involves 
convening experts from multiple disciplines 
to offer guidance on the complex issues 
facing society, from renewable energies to 
the hazards of microplastics. 

Near our office, there is a path along the 
Ottawa River that goes by the Parliament 
and Supreme Court buildings and makes 
acircuit, with Ottawa on one side and 
Gatineau, Quebec, on the other. Its natural 
beauty helps me to reflect on complex issues 
and on unexpected scientific findings. 

In May 2018, the Minister of Fisheries, 
Oceans and the Canadian Coast Guard 
asked me to convene an advisory panel 
onhowtouse science better in decisions 
regarding aquaculture sites and regulations. 
lassembled a team, but there were collisions 
at the start because participants embraced 
opposing views on the value and safety 
of fish farming. After two days of intense 
discussions, | wasn’t sure we could reach 
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a consensus. I took a walk by the water 
and asked myself: “What is it we all have in 
common and that we all value?” I realized 
that everybody values the environment and 
wants what’s best for Canadians. 

During my walk, I developed a strategy 
to establish our common ground by 
asking team members to identify what 
was most valuable to them. We eventually 
recommended, in part, that the government 
use best scientific practices, including local 
and Indigenous knowledge, adopt an open- 
science approach and prioritize regional 
ecosystem differences in making decisions. 

The plants, flowers, trees and shrubs 
along the path, and the changing colours 
of the leaves, are chemistry and biology in 
action. You also come to the locks of the 
historic Rideau Canal. This confluence of 
science and technology reminds me how 
far we’ve come — and of the importance of 
science for providing solutions for society. 


Mona Nemer is a molecular biologist and 
Chief Science Advisor of Canada. Interview 
by Kendall Powell. 
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s scientists race to find low-carbon technologies to reduce 

greenhouse-gas emissions, energy is a hot research topic in 

materials science. It is obvious, then, that the search for better 

batteries is a focus for some emerging stars in materials science 

(see page S26). We also explore the field of energy harvesting, 
to find that everything old is new again: the cellulose fibres in wood, 
cotton and paper, turn out to have acellular structure that can be treated 
to harvest energy from low-heat sources suchas the body, opening the 
possibility that ‘old’ materials might power the smartphones and other 
small devices of the future (see page S38). 

It’s one thing to invent, discover or repurpose a material in a lab and 
publish it ina scientific journal, but transforming it into acommercially 
successful product, or the basis of an entirely new industry, is ahuge 
accomplishment. Thejourney can take decades, be fraught with pitfalls 
and dependent on patient, deep-pocketed commercial partners. Yet the 
rewards can be immense (see page S20). Although we look to materials 
science innovations in the fight to solve climate change, they are, at 
best, one shaft of light in what must bea rainbow spectrum of solutions. 

As government-backed initiatives in the United States and Chinacom- 
pete to resurface ‘lost’ research, the next big discoveries could be made 
by machine-learning algorithms and data mining. China has emerged as 
aformidable force, althougha spread of countries is still evident among 
the top-performing institutions (see page S42). No fewer than 43 of the 50 
institutions that clocked the biggest rises in materials science research 
output inthe Nature Index journals from 2015 to 2018 are from China (see 
page S34). In 2018, the annual growth in China’s materials science Share 
inthe Nature Index was 15.8%; the US’s Share inthe same year fell by 10.3%. 

Readers will note that this supplement introduces this change in our 
metrics terminology: our signature metric, Fractional Count (FC) isnow 
called Share. It is a measure of an institution’s contribution to articles 
in the 82 journals tracked by the index, based on the proportion of its 
affiliated authors on an article’s author list, with each author deemed 
to have contributed equally. The new term for our former Article Count 
(AC), which gives an institution, or country/region a score of 1 for every 
article on which one or more of its affiliated authors appear, is simply 
Count. You can find out more at go.nature.com/indexcount 


Catherine Armitage 
Chief editor 
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Amatter of 
time and risk 


Ina material world, scientific discoveries can revolutionize 
industries, but the road is long. By Neil Savage 


Renewable energy conversion rates are improving through breakthroughs in technologies such as organic solar cells. 


nthe 1960s, the smart advice to a young 
person in search of a lucrative career was 
‘plastics’, as canonized in the 1967 film, 
The Graduate. Today, the advice might be 
‘OLEDs’ or ‘nanomaterials’. In another 20 
years, it might be ‘graphene’ or ‘topological 
materials’. 

The hunt for new, commercially viable mate- 
rials with highly sought-after properties is one 
of the most competitive pursuits in scientific 
research. It offers clear applications, strong 
paths to market and is among the most inter- 
disciplinary of the physical-sciences research 
fields. Even so, relatively few discoveries, such 
as organic light-emitting diodes (OLEDs) and, 
to a lesser extent, graphene, rise above the 
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throng of nascent materials; many languish 
in the scientific literature. 

The stakes are high: a new material in the 
right place and at the right time can launcha 
multibillion-dollar industry. Consider what 
happened for Charles Goodyear. When he 
mixed sulfur with natural rubber and invented 
vulcanized rubber in 1839, his discovery gave 
rise toa multitude of newrubber products with 
unprecedented tensile strength and elasticity. 
It advanced the automobile industry, freeing 
motorists from unwieldy iron wheels. With the 
smoother ride offered by rubber tyres came 
the impetus to create the first gas-powered 
cars. More recently, global industries produc- 
ing smartphones, wearable medical devices, 
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renewable energy and new aeroplanes have 
emerged from materials science research. 
Artificial intelligence might prove to be the 
Charles Goodyear of our era. Machine-learning 
techniques are being used to comb through 
published research to unearth materials already 
discovered but forgotten, and to predict the 
properties of new materials that don’t yet exist. 
This replaces the trial-and-error approaches 
the field has relied on. According to James War- 
ren, a physicist at the US National Institute of 
Standards and Technology in Gaithersburg, 
Maryland, ina very dynamic field it’s one of the 
most exciting developments in recent years. 
Warren is director of the Materials Genome 
Initiative (MGI), a multi-agency project set up 


JOSEPH XU, MICHIGAN ENGINEERING COMMUNICATIONS & MARKETING 


by the US government in 2011. The database 
of materials and their properties is open and 
available to researchers, to help move those 
materials from journal pages into potential 
products, says Warren. “The whole point of 
MGlis literally to bridge that gap,” he says. 

In just a few years, he says, it may be possi- 
ble to tell a computer that you want to build 
a battery with a certain storage capacity, life- 
time and cost, and have it suggest the optimal 
combination of materials to make it happen. 


Along road 


It can take decades for a new material to 
have an impact. The worldwide market for 
OLEDs reached US$19.45 billion in 2017, and 
is expected to grow to $81.76 billion by 2026. 
But, the OLEDS in your TV are a world away 
from the first experiments in the late 1950s to 
generate light from organic materials. 
Stephen Forrest, a physicist at the University 
of Michigan, Ann Arbor, who invented phos- 
phorescent OLEDs, is well aware of that. 1n1994, 
NewJersey-based company, Universal Display 
Corporation, was founded to develop displays 
based on OLEDs. The company is now valued on 
the NASDAQ at more than US$9 billion, and has 
more than 5,000 issued and pending patents 
worldwide, with offices in the United States, 
China, South Korea, Japan and the United 
Kingdom. “The technologies that we had [at 
launch] were not particularly compelling to the 
market,’ Forrest says. “Things happened both 
within the marketplace and within our lab and 
it sort of exploded, and nowwe havea $20-bil- 
lion global display market based on OLEDs, 
and every one of them use our materials.” 
Forrest has helped establish several com- 
panies based on his research, covering areas 
suchas organic solar cells, which are lighter and 
more flexible than their silicon counterparts, 
and indium gallium arsenide detectors for infra- 
red imaging. But it’s not just a matter of creat- 
ing anewtype of material and setting up shop. 
“You've got to be solving a real problem that 
people have an interest in,” he says. “You can’t 
invent markets through work inthe laboratory.” 
Companies often say it takes at least 10 years 
for a material to move from the laboratory to 
the market, says Zhenan Bao, a chemical engi- 
neer at Stanford University in California who 
works with nanomaterials and flexible elec- 
tronics, including organic semiconductors 
and carbon-based circuits. In 2010, Bao’s lab 
spun out C3 Nano, acompany that makes a 
flexible, transparent electrode based on sil- 
ver nanowires. It’s being used in some fold- 
able displays and mobile phones available in 
China, she says, and may soon spread else- 
where. Since she was at Bell Labs (now owned 
by Nokia) 20 years ago, she has been working 


on foldable displays that could fit more phone 
screen ina given space. 

Bao’s focus has broadened to developing 
wearable sensors that adhere to the skin and 
track pulse and other health indicators (see 
‘Wearing well’, page S22), as well as electronic 
skin, a flexible, transparent material made of 
aself-healing silicone polymer with aconduc- 


“It’s not just a matter of 
creating anew material and 
setting up shop. You've got to 
be solving areal problem.” 


tive layer. It can sense pressure and send an 
electrical signal to the brain through an elec- 
tronic artificial nerve. 

Other materials researchers are develop- 
ing electronic skin, including Benjamin Tee 
at the National University of Singapore and 
Shenqiang Ren at the State University of New 
York at Buffalo. 

Graphene could be an important compo- 
nent of the necessary flexible electronics for 


Bao’s electronic skins. Its unique properties 
have excited scientists since the material was 
first isolated in 2004. A one-atom-thick layer 
of carbon, grapheneis stronger than diamond, 
has the highest electron mobility of any known 
material, and is more than 97% transparent to 
wavelengths of light from the ultraviolet tothe 
far infrared. Those properties could come in 
handy, but so far, no one has come up withan 
application that has had much of animpactin 
the marketplace. Grand View Research esti- 
mates the global market for graphene was only 
$43 million in 2017, mainly in applications such 
as protective coatings for flexible electronics. 

The discovery of graphene has, however, 
opened up research into other 2D materials 
— atomically thin layers of an element or com- 
pound. These may be essential to overcoming 
the looming end of Moore’s law, when the tran- 
sistors that power computers physically can’t 
get any smaller. Forrest says material that can 
moveelectrons more rapidly than silicon will be 
necessary to keep increasing computing power. 

Quantum computing, a fledgling technol- 
ogy based on the laws of quantum mechanics, 
might benefit from an emerging class called 


TOP 5 CORPORATE-ACADEMIC COLLABORATIONS* 


Rank _Institution1 Institution 2 Countries Bilateral CS Count 
1 Samsung Group Sungkyunkwan University (SKKU) South Korea 25.89 32 
2 Center for University of Munster (WWU) Germany 20.99 4 
Nanotechnology 
GmbH 
3 Samsung Group Stanford University South Korea/US 13.92 7 
4 Samsung Group Korea Advanced Institute of South Korea 11.91 7 
Science and Technology (KAIST) 
5 IBM Corporation Swiss Federal Institute of Switzerland/US 11.12 16 
Technology Zurich (ETH Zurich) 
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Science (%) 
1 IBM Corporation United States 112.92 223 59.0% 
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3 DowDuPont United States 28.34 56 56.1% 
4 NipponTelegraph Japan 20.41 34 34.0% 
and Telephone 
Corporation 
5 Toyota Group Japan 18.38 47 45.0% 


*Date range for data is 2015-18 
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An adhesive, flexible wearable sensor measures heart and breathing rates. 


topological materials, whose electronic struc- 
tures impart unusual properties. Topological 
insulators, for instance, conduct electricity on 
their surface while blocking it from flowing 
through their bulk. 

Google recently demonstrated Sycamore, 
a computer with 53 quantum bits, which per- 
formed a test calculation in just over three 
minutes that would take a supercomputer run- 
ning at full capacity much longer (F. Arute etal. 
Nature 574, 505-510; 2019). To really make a 
practical quantum computer, though, will take 
thousands or millions of quantum bits, and 
topological materials could create the tech- 
nology to turn that into reality. 

Additive manufacturing, also known as 3D 
printing, is another field that’s taken holdinthe 
past few years thanks to materials advances. 
When it was first developed, 3D printing was 
used mainly to make prototypes of various 
tools by having a laser transform powders into 
asolid, or extruding athermal plastic through 
a nozzle. Researchers have since learnt how 
to work with a variety of materials, including 
metals and conductive plastics, to make parts 
for products. Boeing, for instance, is pursuing 
additive manufacturing as a more cost-effec- 
tive way to make plane parts, and has invested 
in 3D printing companies. In 2018, the US avi- 
ation giant put money into Digital Alloys, a 
Massachusetts-based company with a tech- 
nology that allows it to print different types of 
metals together to enhance their mechanical 
or thermal properties. 
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The multinational manufacturing con- 
glomerate, 3M, which makes everything from 
transparent bandages to Post-It notes, has great 
interest in emerging materials innovations, says 
Greg Anderson, vice-president of its corporate 
research laboratory in Maplewood, Minnesota. 
But it usually asks whether an innovation can 
lead to a new platform, rather than a single 
product — morea 3D printing technology that 
can lead toa whole series of products, rather 
than the one widget made by 3D printing. 

Despite the great potential for innovative 
products, there is significant risk that new 
materials discoveries will never amount to 
something marketable. Because of this, many 
companies like to support their develop- 
mentin small steps, by sponsoring academic 
research and funding university spinoffs, to 
see whether the technologies that emerge are 
worth more significant investment, Anderson 
explains. “We'll do sponsored research to help 
us mitigate the risk,” he says. “We'll spenda 
little bit of money in partnership with the uni- 
versity to advance the science and see how 
feasible that technology is.” 

“In academia, we can dream something 
really big. What we do could be science fiction,” 
says Bao. “But some of those [ideas] could be 
turned into reality. That depends onthecom- 
pany and their creative thinking of howto use 
some of the crazy things that we invent.” 


Neil Savage is a freelance science writer in 
Lowell, Massachusetts. 
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Wearing well 


Stretch goals of a long-time partnership 
yield flexible electronics breakthroughs. 


Stanford University’s Zhenan Bao is taking 
wearable electronics to a radical new level 
inspired by skin — flexible, stretchable 
and unobtrusive. In August 2019, she and 
her team detailed a wearable sensor in 

the form of a sticker. lt measures heart 
and breathing rates, and transmits this 
information via a flexible antenna and 

a radiofrequency identification tag to a 
receiver clipped to the wearer's clothes. 

The device, described in Nature 
Electronics (S. Niu et al. Nature Electron. 2, 
361-368; 2019), is the latest milestone in 
Bao’s long-running collaboration with the 
South Korean conglomerate, Samsung, and 
its in-house lab, the Samsung Advanced 
Institute of Technology in Suwon. 

Bao’s team had been working with 
Samsung on carbon nanotubes — 
cylindrical sheets of carbon atoms — for 
a decade when she pitched an idea for 
stretchable electronics in 2014. These were 
to be more than just rigid components with 
flexible joints, they were conceptualized as 
skin-like, intrinsically stretchable materials. 
This was a continuation of the previous 
work, as carbon nanotubes could be 
incorporated into such materials. 

Samsung signed on. Its work with 
Bao and others has become a mainstay 
of the most productive international 
corporate-academic collaboration in the 
Nature Index. Bao describes the financial 
support as “substantial”. For decades 
Samsung has funded research at South 
Korean universities, and it acquired one of 
the nation’s leading institutions outright, 
Sungkyunkwan University, which has 
campuses in Seoul and Suwon. Samsung 
sponsors research worldwide through its 
Global Research Outreach programme, 
open to all researchers. 

In the early stages of the collaboration 
on stretchables, Samsung set performance 
targets for the electrical conductivity of the 
material that would be required to design 
viable products. Bao admits that she 
initially thought they were impossible, “but 
this kind of impossible target can make us 
more creative”, she says. 


Mark Zastrow 
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People of 
substance 


These early-career scientists are 
ushering in a world of new materials. 


he five researchers profiled here are 

emerging as star competitors in a 

highly competitive field. Whether 

building better batteries or unlocking 

the brain’s mysteries, they are deploy- 
ing novel materials in novel ways to address 
pressing problems. 

These three women and two men stood out 
from more than 10,000 researchers identified 
in the League of Scholars database as having 
five or more publications since 2015 in any 
of the following six materials-science-spe- 
cific journals tracked by the Nature Index: 
ACS Nano, Advanced Functional Materials, 
Advanced Materials, Nano Letters, Nature 
Materials and Nature Nanotechnology. 

Journals were selected for inclusion in the 
Nature Index based on perceptions of quality 
by independent panels of expert researchers. 
To be considered for selection, the materials 
scientists featured here had to be early in their 
careers, in this instance meaning it is less than 
five years since their first article citation. They 
have a strong track record of year-on-year pub- 
lication and all have an outstandingly high 
annualized h-index, a metric which recognizes 
both the output and impact (using citations as 
a proxy for impact) across all journals while 
normalizing for differences in disciplines and 
career stages. 

The citations data shown are drawn from 
the Dimensions for Digital Science database. 


Lighting up the brain 
Blanca del Rosal 
Swinburne University of Technology 


Froma medical perspective, the greatest thing 
about synthetic nanomaterials, specifically 
nanoparticles, is their small size. Because 
they are onthesamescale as human cell parts, 
they are useful as minute probes for spying on 
cellular machinery without causing too much 
disruption. Blanca del Rosal is developing a 
technology to do just that, using nanoparti- 
cle beams to unveil the inner workings of the 
brain. With her team at Swinburne University 
of Technology’s Centre for Micro-Photonicsin 
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Melbourne, Australia, she is figuring out how 
luminescent nanoparticles can track temper- 
ature changes during the treatment of brain 
tumours, which could lead to safer approaches. 

Her research could also provide a minimally 
invasive way to observe how the brain behaves 
during strokes and comas. “The usual way to 
do it is by sticking a physical probe into the 
brain,” says del Rosal. 
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In 2014, del Rosal tested the ability of 
luminescent nanoparticles to treat cancer 
in mice as a PhD student at the Autonomous 
University of Madrid in Spain. A nanoparticle 
solution injected into the tumours was heated 
with a laser to around 20% higher than nor- 
mal body temperature. The heat generated 
by the irradiated nanoparticles reduced the 
size of the tumours, and the team used the 
fluorescence to monitor changes in tempera- 
ture. In 20 days, the tumours in the mice were 
eradicated (E. Carrasco et al. Adv. Funct. Mater. 
25, 615-626; 2015). “It was the first time that 
anyone had used this contactless method for 
sensing temperature below the skin in animal 
models,” says del Rosal. 

Nowa postdoctoral research fellow at Swin- 
burne, del Rosal is investigating how to apply 
this new temperature-sensing approach to 
mapping brain activity in real time, with the 
hope of developing a non-invasive method of 
observing how the brain responds to condi- 
tions suchas epileptic seizures, brain injuries 
and strokes. 

She is also interested in how light-emit- 
ting nanoparticles can be used to assess 
the efficacy and safety of laser-based ther- 
apies used for restoring vision and treating 
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chronic pain. “There’s still alot we don’t know 
about the brain, but light can give us more 
insight into how it works,” says del Rosal. 
Gemma Conroy 


Stabilizing influence 
Emily Hitz 
University of Maryland 


Animportant aim for NASA is avoiding explo- 
sions during Moon missions. That’s why Emily 
Hitz is investigating the lunar applications of 
solid state batteries as part of her three-year 
fellowship with the US space organization. 


EAMON GALLAGHER FOR NATURE 


Blanca del Rosal is using nanoparticle beams to peer inside the brain. 


Lithium-ion batteries, one of the most 
common types of rechargeable batteries 
on the market, use flammable liquid or gel 
electrolytes. Solid-state batteries, which use 
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solid electrodes and electrolytes made from 
ceramic, polymer or glass, offer far greater 
stability. “NASA is interested in having very 
safe batteries,” says Hitz. 

Solid-state batteries are attracting interest 
as an emerging technology because of their 
safety, the large amount of power they can dis- 
pense per unit of battery size, and their long 
cycle life, meaning they can complete a high 
number of charge/discharge cycles at optimal 
capacity. 

Based in the Department of Materials 
Science and Engineering at the University 
of Maryland (UMD), Hitz is interested in 
how new energy technologies can become 


more accessible by incorporating cheap and 
easy-to-manufacture resources. “One day we 
might literally be using a block of wood to 
make a battery,” she says. “We use whatever 
we can to make it work. I like that flexibility.” 

Having graduated from UMD with an elec- 
trical engineering degree in 2015, Hitz made 
the transition to materials science for her 
PhD. She wanted the opportunity to discover 
highly desirable material properties by hav- 
ing a stronger chemistry focus in her work. 
“I wanted to actually change the materials I 
was working with; to change their properties,” 
she says. 

Hitz’s ultimate goal is to see new kinds of 
commercially viable batteries become part of 
amore diverse landscape of renewable energy 
technologies. “You can’tjust sit in the lab and let 
the technology die there,” she says. Bec Crew 


Solar cell optimizer 
Juan-Pablo Correa-Baena 
Georgia Institute of Technology 


For materials scientist, Juan-Pablo Cor- 
rea-Baena, the key to addressing the world’s 
growing energy demands is understanding 
solar cells at the nanoscale. At the Georgia 
Institute of Technology, he is uncovering the 
chemical reactions that underpin the perfor- 
mance of perovskite solar cells. They are cheap 
to manufacture and have shown remarkable 
increases in efficiency over the past decade, 
achieving a higher energy output than com- 
mercially available silicon solar cells. 
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Lead halide perovskites can convert sun- 
light into electrical energy, making them ideal 
materials for solar cells. Researchers have been 
experimenting with different ways to assemble 
perovskite solar cells to improve their stability 
and efficiency. 

Amajor limitation of perovskite solar cells is 
that they don’t hold up well against moisture 
or extended periods of light or heat, so Cor- 
rea-Baena is working on techniques to create 
a protective outer layer that doesn’t compro- 
mise the cell’s performance. 
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During his postdoc at the Massachusetts 
Institute of Technology (MIT) in 2018, Cor- 
rea-Baena used the powerful light beams gen- 
erated bysynchrotron particle accelerators to 
probe the nanostructure of perovskite solar 
cells. He found that adding low concentra- 
tions of caesium and rubidium to perovskite 
solar cells can boost their performance. “It 
was one of the biggest discoveries I’ve had,” 
says Correa-Baena. “It opened up this whole 
new field of things that we could explore using 
synchrotrons.” 

Improving access to power for rural commu- 
nities in Correa-Baena’s home country, Colom- 
bia, is the driving force behind his research, 
he says. “There’s much more that we can do 
in research to try to push forward topics that 
are important to the world.” Gemma Conroy 


Flexible thinking 
Huang Yang 
Shenzhen University 


Huang Yang, an assistant professor of mate- 
rials science at Shenzhen University in China, 
has emerged as a high-impact researcher 
through his work with supercapacitors, an 
alternative to batteries that could provide 
energy storage solutions for wearable and 
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bendable technologies. Supercapacitors are 
ideal in applications requiring many rapid 
charge and discharge cycles, suchas electric 
cars, and could be the key to advancing fold- 
able smartphones and television screens. In 
2019, Motorola, Samsung and Huawei intro- 
duced bendable smartphones to the market, 
but although their screens can fold like a book, 
their lithium-ion batteries are not flexible. The 
current fix is to place either two electrodes of 
one battery, or two or more separate batteries, 
at either end of the screen, but this comes ata 
cost to the overall power of the device. 

The ability to withstand bending, or ‘defor- 
mation’, isa major goal for energy storage tech- 
nology. Huang is developing a flexible ‘shape 
memory’ supercapacitor that can be stretched 
and deformed. A heat trigger is then used to 
restore its shape and stability. The technology, 
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Huang Yang makes ‘shape memory’ supercapacitors, an alternative to batteries. 


he says, would be ideal for powering wearable 
tech, as it can be sewn into fabric. 

Huang and his colleagues take a multi- 
disciplinary approach to developing super- 
capacitors. He says the unconventional way 
that they test the limits of the technology is 
revealing some promising new directions for 
flexible devices. “We’re combining organic 
and inorganic material, trying methods both 
at the nanoscale and macro-scale, and we’re 
attempting new and traditional materials,” he 
says. Hepeng Jia 


Taking charge 
Rachel Carter 
US Naval Research Laboratory 


An invention that won three researchers the 
2019 Nobel Prize in Chemistry, lithium-ion bat- 
teries launched the portable electronics indus- 
try and are expected to play a crucial role in 
future renewable energy storage. Scientists are 
nowworking to solve problems relating to their 
safety, performance and the amount of waste 
they produce. In Australia, 2% of lithium-ion 


Citations 
3 
fo) 


ie) 
2016 2017 2018 


Publication year 


2015 2019 


© 2019 Springer Nature Limited. All rights reserved. 


batteries are recycled; inthe United States and 
Europe, the rate is less than 5%. 

Rachel Carter, a research engineer at the 
United States Naval Research Laboratory 
(NRL) in Washington, is investigating ways 
to expand the lifespan of lithium-ion batter- 
ies, while also pursuing alternatives, such as 
lithium sulfur batteries and sodium sulfur 
batteries. 

The biggest challenge with rechargeable 
batteries, says Carter, is that the chemical reac- 
tion that generates power isn’t 100% reversi- 
ble — meaning the battery loses capacity with 
every charge. “Inevitably, they will degrade,” 
says Carter. 

“Our focus is on developing materials that 
produce high energy and high reversibility, 
but also to understand their behaviours from 
a materials science perspective, such as what 
happens when they're heated or cooled.” 

With her colleague, Corey Love, a materials 
research engineer at the NRL, Carter is finding 
ways to detect temperature variations, acom- 
mon cause of dangerous leaks in lithium-ion 
batteries, without opening them. 

“Using this technique to determine when 
thermal imbalances occur, we can detect 
unsafe cell conditions,” says Carter. 

She is passionate about working with other 
young researchers on energy storage solu- 
tions. She is also leading initiatives within 
the NRL to foster professional development 
for female scientists. “I’m looking forward to 
seeing what happens with these battery chem- 
istries, and collaborating with up-and-coming 
scientists is an exciting part of that,” she says. 
Bec Crew 
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Scaling up 


Chinese institutions monopolize the fastest-rising ranks 
for materials science output in the Nature Index. 
Data analysis by Bo Wu. Infographic by Denis Mallet. 


TOP 10 RISING INSTITUTIONS 
IN MATERIALS SCIENCE — GLOBAL 


Institutions are ranked by the increase in their Nature Index Share in materials science between 2015 
(inner circle) and 2018 (outer circle). The absolute change in their Share over the period is also shown. 
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Legend 

Institutions are ranked from left to right by the 
absolute increase in their Nature Index Share for 
materials science research in journals tracked by 
the index, 2015-2018. Circles are sized by an 
institution’s Share in the index, which takes into 
account the proportion of their affiliated authors’ 
contributions to articles published in the Nature 
Index journals. The absolute change in their Share 
over the period is also shown. 
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RISING INSTITUTIONS IN MATERIALS 
SCIENCE TOP 50 OUTSIDE CHINA 


Just 7 institutions from outside China were ranked among the 50 institutions 
worldwide with the fastest-rising output in materials science 2015-2018. 
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GLOBAL SHIFT 


As institutions compete to author the finite number 
of articles which can appear in the 82 high-quality 
journals tracked by the Nature Index, the ascendancy 
of Chinese institutions translates to declines among 
some international competitors. 
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WEIGHT OF MATERIALS 


Materials science Share as a proportion of total 
Share in the Nature Index is shown for the top ten 
countries, ranked left to right by their 2018 
materials science Share. 
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Cotton fibres can become conduits for ions, atoms or molecules, and generate electricity. 


Energy harvesters 
pick up power 


New technologies are channelling incidental 
energy into practical uses. By Mark Zastrow 


ood seems an unlikely material 

for the forefront of an energy 

revolution. But materials scien- 

tist Tian Li believes it could unlock 

electricity froma source of energy 
that is both sustainable and ubiquitous: our 
own body heat. 
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The key lies in the remarkable fibrous 
structure of cellulose, the main component 
of wood, paper and cotton. These nanoscale 
fibres are “a beautiful structure” anda marvel 
of natural molecular engineering, says Li, a 
postdoctoral researcher at the University of 
Maryland. When treated with a chemical bath, 
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the fibres can become conduits for ions, atoms 
or molecules with a net electrical charge. By 
exploiting how ions move in the presence of 
heat, they can generate electricity, even from 
small temperature differences, recovering 
energy that would otherwise be lost to the 
atmosphere. 

This is just one way that researchers and 
companies are trying to scavenge energy from 
our ambient environment in its many forms, 
including radio waves, vibrations and sound. 
The global ‘energy harvesting’ industry is 
worth around US$500 million, and growing by 
roughly 10% each year, according to industry 
reports. Its development is seen as crucial to 
the expansion of the Internet of Things (IoT), 
avision in which our homes, offices and infra- 
structure are made ‘smart’ with vast numbers 
of networked sensors. Proponents also say it 
can contribute to a greener world. “We need 
as many technologies as possible to generate 
sustainable energy,’ says Li. 


Wooden heat 


Liand her colleagues at the University of Mar- 
yland developed a process for soaking slabs 
of common American basswood ina chemical 
bath of sodium hydroxide (T. Li et al. Nature 
Mater.18, 608-613; 2019). This extracts the 
lignin, a natural polymer intertwined around 
the cellulose fibres, which gives wood its brown- 
ish hue. Thetreatment also breaks the hydrogen 
bondsalong the remaining cellulose nanofibres 
and forms a crystalline structure with highly 
aligned molecular strands, along which sodium 
and hydroxide ions can freely travel. 

Li fabricated similar materials when she 
studied semiconductors in graduate school, 
but this process is much simpler. “I recall hav- 
ing to go to the fabrication lab for a week to 
make sure all the channels were aligned and 
perfect,” she recalls. “Then, when I did my post- 
doc, I was looking at this material [wood] and 
was amazed. It had perfectly aligned channels.” 

When placed against a heat source, the 
nanostructure of the cellulose allows the 
sodium ions to drift away from the source 
more easily than the hydrogen ions, creating 
an electric potential that can later be tapped 
to release electricity. This treated cellulose 
can harvest heat energy from sources just 5 °C 
warmer than its surroundings. 

Lienvisions shirts made of similarly treated 
cotton, which is nearly pure cellulose, that 
could harvest body heat, storing itin a battery 
to recharge phones, or charging them wire- 
lessly in pockets. “Previously, when people 
talked about wearable flexible devices, they put 
this device ontop of asubstrate,” she says. But 
in this case, a separate surface is not needed. 
“Your cotton t-shirt can bea device itself” 
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Wood could also potentially tap into other 
industrial sources. It is estimated that the 
global amount of low-grade wasted heat 
from industrial processes and other sources 
is enough to power more than 6,000 typical 
one-gigawatt nuclear reactors. 


Sensing an opportunity 


Over the past decade, the potential of energy 
harvesting has been alternately hyped and 
challenged. Some applications have been 
stymied by physics: absorbing a sufficient 
amount of energy requires devices that are 
too large to fit in small sensors that can use 
cheaper batteries instead. 

“The key is coupling the energy harvester 
to alow-power processor or radio,” says Mike 
Demler, asemiconductor technology analyst 
with the Linley Group, a Californian consulting 
firm. 

Many people view the sensors inloT devices 
as a key application for energy harvesting. 
“Sensors usually consume very little power,” 
says Kim Dong In, an electrical engineer at 
Sungkyunkwan University in Suwon, South 
Korea. The problem with IoT sensors, he says, 
is the vast number of them that need to be 
deployed, and the challenge of recharging 
them or replacing their batteries. Energy har- 
vesting is a natural fit for powering sensors that 
are not easily accessible — behind the walls ofa 
smart home, for instance, orembeddedinthe 
structure of a bridge to monitor its integrity. 
Kim sees these as perfect environments for 
devices that can recharge themselves by har- 
vesting energy from radio waves. “We cannot 
see them, but a lot of energy is available from 
surrounding electromagnetic signals such as 
TV, wifi and cellular phone towers,” he says. 

In such rechargeable devices, an antenna 
picks up radio waves, which induce an alter- 
nating current. This is passed along toa diode, 
which converts the signal into a direct current 
that can charge a capacitor. The challenge is 
that the effective charging range is only a 
few metres and diodes are not very efficient. 
For example, the signal strength needed to 
generate useful power far exceeds the signal 
strength a wifi router needs to maintain a 
strong connection. 

Kim proposes a solution: using a separate 
base station designed purely to transmit power, 
not information. Inasmart home, this could be 
incorporated into a robotic vacuum cleaner 
that roams the house, recharging sensors as 
it goes. For a bridge, a drone equipped witha 
power transmitter could fly down its length 
to top up embedded sensors. This, Kim says, 
makes radio-frequency harvesting one of the 
more flexible energy harvesting techniques. 

The concept could be taken even further by 


An illuminated installation that takes its energy from the footsteps of people passing by. 


the use of wallpaper with carefully designed 
metasurfaces — surfaces with nanoscale struc- 
tures made of individual units, akin to pixels, 
that can manipulate the wavefronts of light. By 
applying asmall voltage to each unit, research- 
ers can create an artificial lens, focusing and 
steering beams of radio waves to their targets. 

Some energy harvesting techniques have 
more proven track records. Wearable devices 


“We need as many 
technologies as possible 
to generate sustainable 
energy.’ 


that harvest energy have been around for cen- 
turies: automatic watches, which wind them- 
selves using an oscillating weight as the user 
moves them, date back to the 1770s. The first 
electronic device to harvest its own energy in 
this way was also a watch, patented in 1988 by 
Seiko. Solar cells have powered watches and 
calculators for decades, and are frequently 
used in loT sensors, says Demler. 

Another relatively well-established energy 
harvesting technique is based on piezoelec- 
trics, materials that generate electric charge 
when squeezed or pressed. In 1998, MIT 
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researchers were some of the first to exper- 
iment with embedding them in shoes — a 
technique that garnered the interest of the US 
military. By 2008, Japan was experimenting 
with embedding piezoelectrics in the floors 
of subway stations to power turnstiles. 

Work continues on a variety of experi- 
mental techniques. One promising class of 
devices works by generating triboelectricity, 
also known as static electricity. These small 
devices, called triboelectric nanogenerators, 
use friction between their moving parts, such 
as two sheets of flexible polymers or a silicone 
rubber ball that bounces around a hollow 
plastic sphere. The first such devices were 
developed by a team at the Georgia Institute 
of Technology in Atlanta. 

Ultimately, successful implementation of 
energy harvesting comes down to tailoring a 
technique for a specific use case, says Demler. 
Further advances in computer chips that can 
use lower amounts of power will help, too. As 
for Li’s nano-engineered wood, she says she’s 
yet to start work on commercializing it, but 
plans to once she lands a faculty position. “It’s 
such an open field,” she says. “And I feel this is 
truly a field worth exploring.” 


Mark Zastrow is a science writer in Seoul, 
South Korea. 
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